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cells can be modulated by CCI-779 or other drugs. Therefore, these genes can be used as surrogate markers for monitoring drug 
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METHODS FOR MONITORING DRUG ACTIVITIES. IN VIVO 

[0001] This application incorporates by reference the entire disclosures of US. 

Provisional Application Serial No. 60/446,133, filed February 11, 2003 and entitled 
'Methods for Monitoring Drug Activities in Vivo" U.S. Provisional Application Serial No. 
60/459,782, filed April 3, 2003 and entitled "Methods for Diagnosing RCC and/or Solid 
Tumors," and U.S. Provisional Application, filed January 23, 2004 and entitled "Methods 
for Prognosis and Treatment of Solid Tumors" (by Michael Burczynski, et al). In addition, 
this application incorporates by reference all materials recorded in compact discs labeled 
"Copy 1 - Sequence Listing Part," "Copy 2 - Sequence Listing Part" and "Copy 3 - 
Sequence Listing Part," each of which includes "Sequence Listing.ST25.txf' (3,647 KB, 
created February 9, 2004). Furthermore, this application incorporates by reference all 
materials recorded in compact discs labeled "Copy 1 - Tables Part," "Copy 2 - Tables Part" 
and "Copy 3 - Tables Part," each of which includes "Table 1 (the Qualifier Table).txt" (210 
KB, created February 9, 2004). 

TECHNICAL FIELD 

[0002] This invention relates to methods, systems and equipment useful for 

monitoring in vivo activities of CCI-779 or other drugs. 

BACKGROUND 

[0003] CCI-779 is an ester analog of the immunosuppressant rapamycin and as such 

is a potent, selective inhibitor of the mammalian target of rapamycin. The mammalian 
target of rapamycin (mTOR) activates multiple signaling pathways, including 
phosphorylation of p70s6kinase, which results in increased translation of 5' TOP mRNAs 
encoding proteins involved in translation and entry into the Gl phase of the cell cycle. By 
virtue of its inhibitory effects on mTOR and cell cycle control, CCI-779 may function as a 
cytostatic and immunosuppressive agent CCI-779 has been used as an anticancer drug and 
is currently being evaluated for indications in clinical trials for treating various oncology 
and inflammatory diseases. These diseases include, but are not limited to, renal cell 
carcinoma (RCC), prostate cancer, gliomas, lung cancer, and non-Hodgkin's lymphoma. 
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SUMMARY OF THE INVENTION 

[0004] One of the main objectives of clinical pharmacogenomic studies is to identify 

suitable markers for monitoring in vivo activities of CCI-779 or other drugs. The present 
invention employs easily-obtained tissues, such as peripheral blood, as surrogate tissues for 
the detection of in vivo activities of CCI-779 or other drugs. 

[0005] In one aspect, the present invention provides methods that are useful for 

detecting in vivo activities of CCI-779 or other drugs. The methods include comparing an 
expression profile of at least one drug activity gene in a peripheral blood sample of a patient 
of interest to a reference expression profile of the gene, where the gene is differentially 
expressed in peripheral blood mononuclear cells (PBMCs) of patients who have a non- 
blood disease and are subject to a drug therapy as compared to PBMCs isolated from the 
patients prior to the drug therapy. In many embodiments, the patient of interest has the non- 
blood disease and is being treated by the drug therapy. 

[0006] In one embodiment, the drug therapy is an anti-cancer therapy, such as CCI- 

779 therapy, and the non-blood disease is a solid tumor, such as RCC, prostate cancer, or 
head/neck cancer. In another embodiment, the drug activity genes of the present invention 
are selected from Table 5. 

[0007] The peripheral blood samples used in the present invention can be, without 

limitation, whole blood samples or samples comprising enriched PBMCs. Other peripheral 
blood samples that include PBMCs can also be employed in the present invention. 
[0008] The expression profiles of the drug activity genes can be determined by 

various means, such as quantitative RT-PCR, Northern Blot, in situ hybridization, slot- 
blotting, nuclease protection assay, nucleic acid arrays, enzyme-linked immunosorbent 
assays (ELISAs), radioimmunoassay (RIAs), fluorescence-activated cell sorters (FACSs), or 
Western Blots. In addition, two-dimensional SDS-polyacrylamide gel electrophoresis and 
other high-through nucleic acid or protein detection techniques can also be used. 
[0009] In many embodiments, the reference expression profile and the expression 

profile being compared are prepared using the same or comparable methodology. In one 
example, the reference expression profile is an average baseline expression profile of drug 
activity genes in peripheral blood samples isolated from patient or patients prior to a drug 
treatment In another example, the reference expression profile is an expression profile of 
drug activity genes in peripheral blood samples of the patient of interest. The reference 
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expression profile and the expression profile being compared can be obtained from 
peripheral blood samples isolated at different time points in a drug treatment 
[0010] In one embodiment, the drug activity genes used in the present invention are 

over-expressed (or under-expressed) in PBMCs of patient who have a non-blood disease as 
compared to PBMCs of humans who do not have the non-blood disease. The drug therapy 
being investigated can down-regulate (or up-regulate) the expression of the drug activity 
genes in PBMCs of patients who have the non-blood disease. 

[0011] In another embodiment, the expression of the drug activity genes in PBMCs 

can be stimulated (or suppressed) by phytohemagglutinin (PHA). The drug therapy being 
investigated can down-regulate (or up-regulate) the expression of these genes in PHA- 
treated PBMCs. 

[0012] La another aspect, the present invention provides other methods that are 

useful for detecting in vivo activities of CCI-779 or other drugs. The methods include 
comprises comparing an expression profile of at least one drug activity gene in a peripheral 
blood sample of a patient of interest to a reference expression profile of the gene, where the 
RNA transcripts) of the drug activity gene can hybridize under stringent or nucleic acid 
array hybridization conditions to one or more qualifiers selected from the Qualifier Table. 
[0013] In yet another aspect, the present invention provides methods useful for 

identifying drug activity genes. The methods include detecting the gene expression profite 
in peripheral blood samples of patients who have a non-blood disease and are subject to a 
drug therapy, and comparing the gene expression profile to a baseline gene expression 
profile in peripheral blood samples isolated before the drug therapy. Drug activity genes 
whose expression levels in peripheral blood samples can be modulated by the drug therapy 
can therefore be identified. 

[0014] In still another aspect, the present invention provides kits useful for detecting 

in vivo activities of CCI-779 or other drugs. In one embodiment, the kits include a plurality 
of polynucleotides, and each polynucleotide can hybridize under stringent or nucleic acid 
array hybridization conditions to an RNA transcript, or the complement thereof, of a 
different respective drug activity gene. In another embodiment, the kits include a plurality 
of antibodies, and each antibody can bind to a polypeptide encoded by a different respective 
drug activity gene. The drug activity genes can be selected, without limitation, from Table 
5. 
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[0015] . In still yet another aspect, the present invention provides nucleic acid arrays 
useful for detecting in vivo activities of CCI-779 or other drugs. A substantial portion of all 
polypeptide probes on the nucleic acid array can hybridize under stringent or nucleic acid 
array hybridization conditions to RNA transcripts, or the complements thereof, of drug 
activity genes. 

[0016] Other features, objects, and advantages of the present invention are apparent 

in the detailed description that follows. It should be understood, however, that the detailed 
description, while indicating preferred embodiments of the invention, are given by way of 
illustration only, not limitation. Various changes and modifications within the scope of the 
invention will become apparent to those skilled in the art from the detailed description. 

DETAILED DESCRIPTION 

[0017] The present invention provides methods useful for the detection of in vivo 

activities of CCI-779 or other drugs. Numerous drug activity genes can be identified by the 
present invention. The expression profiles of these genes in PBMCs can be modulated by 
CCI-779 or other drugs. Accordingly, these genes can be used as surrogate markers for 
monitoring drug activities in vivo. In one embodiment, the methods of the present invention 
include comparing the expression profile of at least one drug activity gene in a peripheral 
blood sample of a patient of interest to a reference expression profile of the same drag 
activity gene. The patient of interest has a non-blood disease, such as RCC, prostate cancer, 
or another solid tumor, and is being treated by a drug therapy. A change in the peripheral 
blood expression profile of the drug activity gene is indicative of the in vivo activity of the 
drug therapy. In many cases, the reference expression profile can be determined by using 
baseline peripheral blood samples isolated from patients prior to the drug therapy. 
Peripheral blood samples amenable to the present invention include, but are not limited to, 
whole blood samples or samples comprising enriched PBMCs. Expression profiles of drug 
activity genes can be detected using a variety of methods, such as quantitative RT-PCT, 
Northern Blot, in situ hybridization, nucleic acid arrays, enzyme-linked immunosorbent 
assay (ELISA), radioimmunoassay (RIA), FACS (fluorescence-activated cell sorter), or 
Western Blot The drug activity genes of the present invention may also be used for 
assessing the efficiency of a drug therapy. 

[0018] Various aspects of the invention are described in further detail in the 

following sections. The use of sections is not meant to limit the invention. Each section 
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and subsection may apply to any aspect of the invention. In this application, the use of "or" 
means "and/or" unless stated otherwise. Also, the use of the singular includes the plural 
unless stated otherwise. 

A. General Methods for Identifying Drug Activity Genes 

[0019] The availability of the human genome sequence, together with new 

developments in technology, such as DNA microarrays, proteomics and computational 
biology, allows systemic gene expression studies for various diseases. The present 
invention employs the systematic gene expression analysis technique to identify genes 
whose expression in peripheral blood can be modulated by a therapeutic agent such as CCI- 
779. These genes are herein referred to as "drug activity genes." The genes whose 
expression levels in peripheral blood can be modified by CCI-779 are refereed to as "CCI- 
779 activity genes." 

[0020] Drug activity genes can be identified by comparing peripheral blood gene 

expression profiles before and after a drug treatment. Numerous methods are available for 
detecting and comparing gene expression profiles. 

[0021] For instance, gene expression profiles can be detected by measuring the 

levels of RNA transcripts in peripheral blood samples. In one embodiment, total RNAs or 
polyA + RNAs are isolated from peripheral blood samples using conventional means. The 
isolated RNAs can be amplified to produce cDNAs or cRNAs. Peripheral blood gene 
expression profiles can be determined by measuring the amount of the amplified cDNAs or 
cRNAs. 

[0022] Peripheral blood gene expression profiles can also be determined by 

measuring the levels of polypeptides in peripheral blood samples. The amounts of 
polypeptides in peripheral samples can be detected using various methods, such as ELISAs, 
RIAs, FACSs, Western Blots or other immunoassays. In addition, two-dimensional gel 
electrophoresis/mass spectrometry or other high-throughput protein sequencing and 
identification methods can be used. 

[0023] In one embodiment, nucleic acid arrays are used for detecting or comparing 

gene expression profiles in peripheral blood samples isolated at different stages of a 
therapeutic treatment Nucleic acid arrays allow for quantitative detection of the expression 
levels of a large number of genes at one time. Examples of nucleic acid arrays include, but 
are not limited to, Genechip® microarrays from Affymetrix (Santa Clara, CA), cDNA 
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microarrays from Agilent Technologies (Palo Alto, CA), and bead arrays described in U.S. 
Patent Nos. 6,288,220 and 6,391,562. 

[0024] The polynucleotides to be hybridized to nucleic acid arrays can be labeled 

with one or more labeling moieties to allow for detection of hybridized polynucleotide 
complexes. The labeling moieties can include compositions that are detectable by 
spectroscopic, photochemical, biochemical, bioelectronic, immunochemical, electrical, 
optical or chemical means. Exemplary labeling moieties include radioisotopes, 
chemiluminescent compounds, labeled binding proleins, heavy metal atoms, spectroscopic 
markers such as fluorescent markers and dyes, magnetic labels, linked enzymes, mass 
spectrometry tags, spin labels, electron transfer donors and acceptors, and the like. 
Unlabeled polynucleotides can also be employed. The polynucleotides can be DNA, RNA, 
or a modified form thereof. 

[0025] Hybridization reactions can be performed in absolute or differential 

hybridization formats. In the absolute hybridization format, polynucleotides derived from 
one sample, such as a peripheral blood sample isolated from a cancer patient at a particular 
treatment stage, are hybridized to the probes in a nucleic acid array. Signals detected after 
the formation of hybridization complexes correlate to the polynucleotide levels in the 
sample. In the differential hybridization format, polynucleotides derived from two 
biological samples, such as one isolated from a cancer patient at a first stage of treatment 
and the other isolated from the same patient but at a second stage of treatment, are labeled 
with different labeling moieties. A mixture of these differently labeled polynucleotides is 
added to a nucleic acid array. The nucleic acid array is then examined under conditions in 
which the emissions from the two different labels are individually detectable. In one 
embodiment, the fluorophores Cy3 and Cy5 (Amersham Pharmacia Biotech, Piscataway 
N.J.) are used as the labeling moieties for the differential hybridization format. 
[0026] Signals gathered from nucleic acid arrays can be analyzed using 

commercially available software, such as those provide by Affymetrix or Agilent 
Technologies. Controls, such as for scan sensitivity, probe labeling and cDNA/cRNA 
quantitation, can be included in the hybridization experiments. In many embodiments, the 
nucleic acid array expression signals are scaled or normalized before being subject to 
further analysis. For instance, the expression signals for each gene can be normalized to 
take into account variations in hybridization intensities when more than one array is used 
under similar test conditions. Signals for individual polynucleotide complex hybridization 
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can also be normalized using the intensities derived from internal normalization controls 
contained on each array. In addition, genes with relatively consistent expression levels 
across the samples can be used to normalize the expression levels of other genes. In one 
embodiment, the expression levels of the genes are normalized across the samples such that 
the mean is zero and the standard deviation is one. In another embodiment, the expression 
data detected by nucleic acid arrays are subject to a variation filter which excludes genes 
showing minimal or insignificant variation across all samples. 

[0027] A variety of peripheral blood samples can be used in the present invention. 

In one embodiment, the peripheral blood samples are whole blood samples. In another 
embodiment, the peripheral blood samples comprise enriched PBMCs. By "enriched," it 
means that the percentage of PBMCs in the sample is higher than that in whole blood. In 
many cases, the PBMC percentage in an enriched sample is at least 1, 2, 3, 4, 5 or more 
times higher than that in whole blood In many other cases, the PBMC percentage in an 
enriched sample is at least 90%, 95%, 98%, 99%, 99.5%, or more. Blood samples 
containing enriched PBMCs can be prepared using any method known in the art, such as 
Ficoll gradients centrifugation or CPTs (cell purification tubes). 

[0028] Peripheral blood samples used in the present can be isolated at any stage of a 

drug treatment (including baseline samples isolated before the drug treatment). For 
instance, the samples can be isolated from patients at 1 day, 2 days, 3 days, 4 days, 5 days, 6 
days, 1 week, 2 weeks, 3 weeks, 4 weeks, 8 weeks, or 16 weeks after initiation of a drug 
treatment. Other time points can also be used for isolating blood samples for monitoring or 
assessing in vivo drug activities. 

[0029] In many embodiments, the patients being treated by a drug therapy of interest 

have a non-blood disease, such as a solid tumor. Solid tumors amenable to the present 
invention include, but are not limited, RCC, prostate cancer, head/neck cancer, ovarian 
cancer, testicular cancer, brain tumor, breast cancer, lung cancer, colon cancer, pancreas 
cancer, stomach cancer, bladder cancer, skin cancer, cervical cancer, uterine cancer, and 
liver cancer. In one embodiment, the solid tumors have the following characteristics: (1) a 
mass of hypeiproliferating cells of clonal origin, and <2) acquisition of an aggressively 
invasive phenotype, where cancer cells leave the tissue of origin and establish new tumor 
metastases at distant sites. In one example, the patients have RCC. 
[0030] Any cancer or disease treatment can be evaluated by the present invention. 

Exemplary cancer treatments include the use of cytokines, such as interferon or interleukin 
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2. In addition, chemotherapy drugs can be used, either individually or in combination with 
other drugs, cytokines or therapies. Suitable chemotherapy drugs include, but are not 
limited to, CCI-779, AN-238, vinblastine, floxuridine, 5-fluorouracil, and tamoxifen. 
AN238 is a cytotoxic agent which has 2-pyrrolinodoxorubicin linked to a somatostatin 
(SST) carrier octapeptide. AN238 can be targeted to SST receptors on the surface of RCC 
tumor cells. Moreover, monoclonal antibodies, antiangiogenesis drugs, and anti-growth 
factor drugs can be employed to treat cancers. j 

[0031] The gene expression profile in peripheral blood samples isolated at one stage 

of a drug treatment can be compared to that at another stage of the drug treatment. Drug 
activity genes that are differentially expressed in PBMCs at one stage of the treatment 
relative to another stage of treatment can therefore be identified. In one embodiment, the 
PBMC expression level of a drug activity gene is substantially higher at one stage than at 
another stage. For instance, an average PBMC expression level of a drug activity gene at 
one stage can be at least 1.5, 2, 3, 4, 5, 10, 20, or more times of that at another stage. In 
another embodiment, the PBMC expression level of a drug activity gene is substantially 
lower at one stage than at another stage. For instance, an average PBMC expression level 
of a drug activity gene at one stage can be no greater than 0.67, 0.5, 0.33, 0.25, 0.1, 0.05, or 
less times of that at another stage. 

[0032] In yet another embodiment, drug activity genes can be identified using 

clustering algorithms based on the nucleic acid array gene expression data. For instance, 
unsupervised cluster analyses can be used to analyze and categorize genes which have 
differential expression patterns between different drug treatment stages. Algorithms for 
unsupervised cluster analysis include, but are not limited to, self-organized maps (SOMs), 
principle component analysis, average linkage clustering, and hierarchical clustering. 
[0033] Supervised cluster analysis can also be used to organize and identify drug 

activity genes. Algorithms for supervised cluster analysis include, but are not limited to, 
nearest neighbors test, support vector machines, and SPLASH. Either two-class or multi- 
class correlation metrics can be used. 

B. Identification of CCI-779 Activity Genes 

[0034] In one embodiment, HG-U95Av2 gene chips (manufactured by Affymetrix) 

were used for detecting and comparing the levels of RNA transcripts in PBMC-enriched 
peripheral blood samples. Peripheral blood samples were isolated from RCC patients at 
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different stages of CCI-779 treatment (See Example 1). The CCI-779 treatment included 
intravenous administration of 25, 75 or 250 mg of CCI-779 once weekly. Peripheral blood 
samples were isolated from RCC patients immediately before the initial administration of 
CCI-779, and then 8 weeks and 16 weeks thereafter. 

[0035] cRNA were prepared from the isolated PBMC samples and hybridized to 

HG-U95Av2 genechips. Hybridization signals were collected for each oligonucleotide 
probe on the genechips. Signals from the oligonucleotide probes of the same qualifier were 
averaged Qualifiers that produce different hybridization signals in samples isolated at 
different treatment time points were identified. Examples of these identified qualifiers are 
illustrated in Table 1 ("the Qualifier Table"). 

[0036] In general, each qualifier in the Qualifier Table corresponds to at least one 

CCI-779 activity gene, and the RNA transcripts of the gene can hybridize under stringent or 
nucleic acid array hybridization conditions to the qualifier. As used herein, "hybridize to a 
qualifier" means to hybridize to at least one oligonucleotide probe of the qualifier. In many 
embodiments, the RNA transcripts of a CCI-779 activity gene can hybridize under stringent 
or nucleic acid array hybridization conditions to at least 2, 4, 6* 8, 10, 12, 14 or 16 
oligonucleotide probes of the corresponding qualifier. 

[0037] Table 2 lists the expression profiles of some qualifiers that produced 

different hybridization signals for PBMC samples isolated at different treatment stages. 
Each expression profile in Table 2 ("Baseline," "8wks Average," or "16wks Average") was 
an average of PBMC samples of 110 RCC patients. Each expression profile under 
"Baseline," "8wks Average," and "16wks Average" represented the hybridization signals on 
the respective qualifier for PBMC samples isolated immediately before the initial CCI-779 
administration, 8 weeks after the initial administration, and 16 weeks after the initial 
administration, respectively. The p-value of an ANOVA analysis was provided for each 
qualifier. The p-value suggests the statistical significance of the difference observed 
between the expression profiles obtained at different CCI-779 treatment stages. Lesser p- 
values indicate more statistical significance for the differences observed between different 
treatment stages. 



Table 2. PBMC Gene Expression Profiles at Different CC I-779 Treatment Stages 
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(n=110) 


16wks 
Average 
(11=110) 


P-Value 


CPS 


aaaa7 o+ 
4Uou /_at 


o 


01 ft 

23. o 


1A A 


^ 1*P.07 




177AA o+ 
3 /2UU__at 


70 1 


7A A 


111 
111 




XI 
j i 


2o3__at 


13. o 


OA ft 
20.0 


14 ft 
34. o 


A ATh 07 


DO 


40 o /_at 


in o 

lU.o 


31 A 

33.U 


5A 1 
D0.1 


7 IT? 07 


^0 


3o4_ju 


19.2 


1A A 
30.U 


51 7 
31./ 


0 0"P-07 


41 


4Uol4_at 


1 o 
19 


12.2 


Q 7A 


0 AP? 07 


47 


3 /U2 /_at 


31. o 


AA A 


G5 A 


0 AT? 07 


A.'X 


l /ou_s_at 


12.1 


7 50 
/.D2 


5 57 


1 1"P AA 


44 


I292_at 


16.3 


3/.4 


39.o 


1 1Th OA 

i . i .ts-uo 


4S 


3o2o7_at 


26.J 


04.9 


70 ft 


1 0"R 0^ 
1 .ZH-UO 


4A 


41591_at 


i a a 
10.9 


A OA 

6.9U 


A 7A 
4. /O 


1 IT? OA 
1 .OH-UO 


47 
4/ 


32475_ at 


>i ao 
4.62 


Q A< 


17 7 
1Z. / 


1 'XU OA 


48 


4U7o2_at 


1A A 

36.4 


23.y 


ft 7Q 

o.zy 


1. AT7 OA 


40 


503_at 


1 A H 

19.7 


33.3 


A1 7 
43 .Z 


1 OA 


50 


o75 g at 


1 io 
132 


Aft 7 
DO./ 


1A ^ 


1 OA 


<1 


371o5_at 


2oo 


1 AA 
16U 


1A A 
34.U 


1 A"R OA 
1 .Oxl-UO 


57 
3Z 


322o2_at 


1 A G 

1U.9 


A 57 
4. J 1 


A 






4U9oi_at 


o in 

y.iu 


A A« 


A 1 n 


1 OP-OA 


54 


34/o2_ at 


7/1 A 

/4.U 


AA 7 
40.2 


7A A 
Z4.D 


1 OP-OA 




111 HI 

3 15 /3_at 


ii n 

n.y 


1 O A 
1V.4 


7A n 


7 0P-0A 

Z.UJCr-wO 


5A 


4UUio_at 


Q 

o 


A A7 
4.0/ 


^ ^7 


7 1P-0A 


57 ' 


3163o_at 


7 in 
/.19 


1 7 Q 

lz.y 


1 a n 


7 IP-OA 

Z. IJl-UO 


58 


3ol9l_at 


\A A 

14.4 


1 1A 
3.14 


j.jO 


.7 AP-0A 


50 


33 3 9o_at 


aa 7 
44.Z 


I'X 7 
/J.Z 


Q1 0 


7 AP_0A 




3olU4_at 


4.J.J 


A 7A 
O. /Q 


5? 7f? 


7 AP-0A 


61 


32i93_at 


7 on 


1 A 1 


74 1 

Z*T. X 


7 Rfc.0A 


62 


3o3jlML_at 


a on 

*h7U 


R Q5 


10 4 


7 RP-06 


63 


i9oD_s_at 


5 57 


Q 1Q 


1 1 Q 

1 1.7 


7 QP.0A 


64 


3ojyo_ai 


in 5 


^ 71 


J.UJ 


J*mJUl \J\J 


65 


41446 f at 

i X TTv 4. 


17.6 


38.9 


44.8 


3.3E-06 


66 


41436_at 


15.8 


9.71 


7.05 


3.3E-06 


67 


36672_at 


7 


17.2 


26 


3.4E-06 


68 . 


38717 at 


6.62 


15.1 


21.0 


3.7E-06 


70 
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Qualifier 


Docalino 

j>asciiJie 
(n=110) 


8wks 
Average 
(n=110) 


16wks 
Average 
(n=110) 


ANOVA 
P-Value 


CPS 


19181 at 


^6 i 


99 6 


14 Q 


3.8E-06 


71 


19717 of 

jZ/d /_ai 


14 0 


96 4 


16 0 

JU.7 


4.3E-06 


72 


4Ui4u_ai 






1 1 Q 

11.7 


4.6E-06 


73 


4f.fi IB Q + 




14 1 


10 


4 7E-06 


74 


1A7Q7 of 
34 /O /__al 


1 S 7 


94 R 


37 2 


4 9E-06 


75 


1C*C^9 of 


7 ns 


IIS 


\ 15 5 


5 0E-06 


76 


/I 1 Q1 O c« of 


^ 76 


in ^ 

lU.J 


19 7 

1 fa. / 


5 1F-06 

*/• XX-» V/vl 


77 


O 1 /TOO -P «4- 

31o22_l_at 


lo.D 


14 0 ■ 


18 1 


6 3F-06 


78 


40o2o_at 


3U, / 


94 1 


14 Q 


6 7F-06 

U. /JZr"V/\J 


79 

/ 7 


3yoy3_at 


3. /O 


q in 
7.iu 


1 1 0 


7 7F-06 

/ . / X-f V/VJ 


80 


4uyy4_at 


19 9 


9 67 


S 90 


7 RF-06 


81 


jyoU2_at 


94 
Z4 


11 6 


5 19 

J.17 . 


9 1E-06 


83 


4Uju /_at 


IQ 1 
J7.1 


6S 4 


Q6 R 


1 0E-05 


84 


<50C10 —x. 

3o0lo_at 


10./ 


11 O 

x 1.7 


6 67 


1 1F-05 


85 


13oo_at 


in a 

i /.y 


111 


4 


1 1F-05 


87 


41653_at 




4 *\7 


1 90 

J.Z7 


1 3F-05 


88 


41ooy_at 


119 


8 41 


S 10 


1 3E-05 


89 


3u4oo_at 


7 C1 


17 7 
l / • / 


25 5 


1.4E-05 


90 


1 AAC of 

iuuD_at 


61 A 


19Q 


166 


L5E-05 


91 


K/fA-Z of 


71 9 


17 3 


26 4 


1.7E-05 


92 


4U j /yjat 


7 A7 


A OA 


^ 43 


1 7E-05 


93 


3ooU /jat 


6 fK. 


11 0 

1 1»7 


16 2 


2 OE-05 


94 


34311_at 


icq 

1 J.O 


94 8 


34 9 


2 1E-05 

fa* X fa/ \J*J 


95 


3 j /oD_at 


Q9 A 
7Z.4 


71 7 

/ x . / 


45 3 


2.1E-05 


96 


/1A.C90 *• of 

4Uozz_r_at 


4S 
4 0 


94 4 
fa*r.*t 


20 9 


2 2E-05 


97 


11777 of 


7 41 


90 9 


31 7 


2 2E-05 


98 


icoon o* 
jo2/u_at 


S 71 


1 1 4 


16.8 


2.5E-05 


99 




6 86 


26 5 


39.9 


2.6E-05 


100 


16Q1 c at 

i ioto s ax 


191 

DAI 


244 


91.2 


2.9E-05 


101 


32612_at 


5.90 


8.86 


12.1 


2.9E-05 


102 


41045_at 


16.7 


33.4 


48.8 


3.0E-05 


103 


37233_at 


19.9 


9.52 


2.86 


3.0E-05 


104 


37967_at 


61.3 


132 


159 


3.1E-05 


105 
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Qualifier 


Baseline 
(11=110) 


8wks 
Average 

fn=1 1 (ft 


16wks 
Average 

\I1 X±.\t) 


ANOVA 
P-Value 


CPS 


2049 s at 


11.9 


26.9 


32.8 


3.3E-05 


106 


39997 at 


24.3 


38.5 


49.0 


3.4E-05 


107 


41332 at 


8.14 


13.7 


18.1 


3.4E-05 


108 j 


115 at 


51.5 


35.0 


9.19 


3.6E-05 


109 


40964 at 


7.62 


3.29 


3.10 


3.7E-05 


110 


35303 at 


25.8 


19.6 


12.0 


4.0E-05 


111 


40742 at 


10 


28.8 


37.4 


4.1E-05 


112 


37955 at 


4 14 

i « X • 


5 81 


8.38 


4.4E-05 


113 


3969R at 


4 10 


6 52 


9.29 


4.5E-05 


114 


JZJOU dt 


6 05 


9 67 


12.4 


4.5E-05 


115 - 


401 50 r at 


10 5 


20 9 


46.6 


5.0E-05 


116 


32635 at 


8 24 


3.95 


3.57 


5.0E-05 


117 


35840 at 


5 05 


7,86 


11.8 


5.1E-05 


118 


37661 at ' 


21.2 


13.3 


6.48 


5.1E-05 


119 


411 i at 


29.2 


40.1 


65.6 


5.1E-05 


120 


35012 at 


20.4 


40.3 


74.9 


5.2E-05 


121 


3R760 f at 


9.33 


16.0 


21.4 


5.3E-05 


122 


36474 at 


9 48 


578 r 


4.48 


53E-05 


123 


41440 at 


4.24 


9,38 


9.10 


5.4E-05 


124 


39953 i at 

J77JJ X »t 


23.1 


9.14 


7,71 


5.8E-05 


125 


245 at 


45.8 


62.2 


103 


6.0E-05 


126 


37181 at 


3.67 


5.90 


7.52 


6.0E-05 


127 


39799 at 


31.3 


21.3 


7.90 


6.2E-05 


128 


39320 at 


6.48 


12.4 


16.1 


6.4E-05 


129 


794 at 


5.29 


10.2 


14.0 


6.4E-05 


130 


36091 at 


7.33 


11.7 


18.7 


6.6E-05 


131 


37187 at 


66.8 


45.1 


11.7 


7.2E-05 


133 


38338 at 


8.76 


14.2 


18.3 


7.2E-05 


134 


34760 at 


14.3 


23.0 


31.0 


7.3E-05 


135 


33133_at 


7.67 


12.3 


16.3 


7.6E-05 


136 


32904_at 


10.2 


19.4 


26.2 


7.7E-05 


137 


37359_at 


7.71 


12.4 


16.3 


8.2E-05 


138 


40485_at 


3.95 


7.81 


9.81 


8.3E-05 


139 
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Qualifier 


X> a scllllc 

(n=110) 


8wks 
Average 
(n—iiv) 


16wks 
Average 
in— nuj 


A NOVA 
P-Value 


CPS 


37215 at 


6 86 


11 3 

1 1 . J 


104 


8 4E-05 


140 


40274 at 


9 10 

J'. 1 V 


21 

z 1 


16 1 


8 5E-05 


141 

X "T X 


41 174 at 


1? 0 


0 OS 


5 71 
/ 1 


8 8E-05 


143 

X "TJ 


1715 At 
1 / l«7__al 


5 86 


14 0 


91 5 
z 1 . J 


8 RF-05 


144 

• X 1 1 T 


41503 at 


10 1 

IVi 1 


7 71 
/ . / 1 


4 00 


8 OE-05 


145 


40300 at 


5 71 

j. / 1 


3 62 


/ 2 81 

J Zr.O 1 


9 2E-05 


146 


40811 at 


13 8 


10 5 


6 81 

VI. 0 1 


9 3E-05 


147 

X"T / 


^1400 c q+ 


1 1 7 

li./ 


74 6 


41 3 

*Tl mJ 


9 3F-05 


148 
1 to 


0AH70 at 


4 76 
t. /O 


7 H 


110 
1 x.y 


0 5F-05 


140 
iny 


117^ o a+ 

1 1 zo_j5_at 


ft 57 


£ 70 


, 4 74 - 


-0 7P-05 


1 50 


0«1 7 o+ 


1 < A 


04 7 


ztO 7 


0 OP 05 


151 
lJl 


joj^ /_at 


1^7 


7 48 


5 05 


0 00010 


159 

1 JZ 


0£1£1 at 


5 74 


7 05 


10 8 


0 00010 


153 


1400 rr at 


77 n 

zz.u 


1 c 

1 J 


8 69 
o.uz 


0 00010 


154 


OQfMl at 


64 8 


06 1 


134 


0 0001 1 

V/.VJv/Ul 1 


155 


0701 1 at 


£7 7 


04 7 


110 


0 00019 


156 


07nno at 


1 6 1 
1D.J 


75 0 


41 7 
*ti • / 


0 0001 3 


157 
i*j / 


O O 7 *+Z__5_jal 


17 0 


Q 57 


7 86 


OD0013 


158 

X «^ 0 




1 67 


Q 86 


10 0 


0 00013 


150 


59Q nt 


10 8 
1 u.o 


93 1 

Z-J. 1 


11 5 


0 00013 


160 

X vy 




8 81 
0.0 X 


9 00 


9 05 


0 00014 


161 


\A r )f, at 


50 8 


45 1 


11 5 

Z / . J 


0 00014 


162 


j i oj *__ai 


1 7 


19 5 


8 10 

O.lv 


0 00014 


163 

1 UJ 




1 94 

IZt 


40 4 


96 3 


0 00014 

V/.V/vV/ It 


164 

X V/t 


36171 r nt 


6 00 


9 52 

Z..JZ 


3 81 


0 00016 

\J ,\J\J\J X Vi 


165 


35975 nt 




9 20 

Z.Z7 


2 81 

z.o 1 


0 00017 


166 


41 1 84 at 

til 0*t a aX 


7 33 


14 8 


18 0 


0 00018 

V/ • \J \J\J X VJ 


168 


38301 at 


16 5 


23 0 


33 3 


0 00018 


170 

X M \M 


30820 at 


28 0 


17 8 


13.6 


0 00019 


171 


37587_at 


6.71 


3.19 


2.90 


0.00020 


172 


34022_at 


35.0 


20.0 


3.52 


0.00020 


173 


34188_at 


13.7 


9.62 


6.05 


0.00022 


175 


37647_at 


13.2 


25.3 


35.2 


0.00022 


176 
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Qualifier 


i>aseuiie 
(n-110) 


8wks 
Average 
(n=110) 


16wks 
Average 
(n— 11U) 


ANOVA 

AllV/VA 

P-Value 


CPS 


400R9 at 


90 4 


19 0 
1Z.VJ 


0 94 


0 00023 


177 


3Q0R9 r at 
J770Z r^di 


7 59 
/. JZ 


A 7A 
4. /0 


9 OS 


0 00094 


17R 


39909 at 

jZZUZ at 


3 14 
J.l*t 


S S7 
D.J / 


R 4R 

O.HO 


0 00096 


179 


J JUJO_aX 


7 33 


0 71 
Z. / I 


9 ft1 
Z.51 


n nno97 


1 R0 
1 ou 


OH A at 
7UO_aX 


1A A 
J4.0 


01 A 
Zj.O 


1 9 A 
1Z.0 


0 00097 


1R1 

l O X 


3A033 of 
JOUjJ_aX 


1Z.O 


ft 3ft 
5. JO 


A 10 
O.IU 


0 0009 ft 


1R9 

lOZ 


3A7AA of 
JO /00__al 


on ft 
zu.o 


3A 9 
JO.Z 


57 9 
J / .Z 


0 00090 


1R3 

i O J 


IftftOI of 

jooyj_ax 


1ft 7 
JO./ 


Aft A 
45.4 


7ft A 
/5.4 


0 00090 


1 R4 

1 Of 


1*707/1 of 

j / d /4_ax 


1 QA 
J.oO 


A 1ft 
O.JO 


o 
o 


n nnn3n 

U.UUUjU 


1 R5 


- 3ASOO of 

jojy7__ax 


4.J 1 


0. /o 


■ * 0 3ft 

y.jo 


0 00030 
U.UUUJU 


1R£ 
1 oo 


AC A o o-f 

4o4_s_ax 


7 A1 

/.4j 


10 7 
lz. / 


IAS 
10. J 


n 00030 

U.UUUJU 


1 R7 
10 / 


jzjjj_j3_ax 


a ns 

O.UJ 


13 1 
1 J.l 


17 0 

i / .u 


0 00031 
U.UUUJl 


lOO 


1AAOA of 

j04tu_ju 


7 AO 
/.OZ 


i 1 
1 1 


ICQ 

1 D.y 


0 00031 
U.UUUJ 1 


1 RO 
i oy 


iaoqo of - 
jozoU_ ax 


7 AO 
/•OZ 


1 1 i 
1 J 


1 ft A 
15.4 


O 00033 
U.UUUJj 


i on 


IftlOA of 

joizo_ax 


1 A 9 
10.Z 


ft ftA 
5.50 


7 71 

/. / 1 


0 00033 

U.UUUJ J 


101 
iy l 


1701 1 of 
j /Ui 1_ ax 


OO A 
ZZ.4 


3ft 0 

- JO.7 


S9 9 
jZ.Z 


0 0003 S 
U.UUUJJ 


109 
iyz 


1Q1AA o of 

jyioo_s_ax 


Q OA 
7.Z4 


0 10 
Z.1U 


9 3ft 

Z. JO 


0 00037 
U.UUUJ / 


103 
iyo 


1 QOS of 

i oz j_ax 


1 A A 
10.0 


Oft 0 

Z5.U 


35 9 
JJ.Z 


0 00040 
U.UUUHU 


1 05 

iy d 


A1 S77 of 

41 j / /_ax 


11 * 


13 7 


4 3R 


0 00041 
U.UUUH1 


1 QA 
lyo 


331A1 of 

j j ioi_ax 


3 14 
0. 14 


A 10 
O.IU 


Q ft1 


0 00049 
u.uuutz 


107 


31A3ft o of 

ji4jo__s__ax 


1 1 s 

1 l.J 


14 7 


93 4 

ZJ.H 


0 00043 

U.UUUHJ 


1QR 
iyo 


3 A3 70 at 


5 OS 
J,UJ 


1 9 A 
iz.o 


10 7 


0 00043 


199 


ooqa c of 

ZU74_JS_ai 


5n 3 


104 

lU*f 


141 • 


0 00045 


200 


! 39077 at 


1 1 A 


99 6 

ZZ.D 


33 4 
j j.*t 


0 0004R 


201 


399A4 at 
jZZOt^dl 


0 90 
y .Z7 


1 R 1 

lO.J. 


90 9 


0 0004R 


202 


•20 1 ftA «f 


q on 


1 A 4 


90 0 

ZV/.v 


0 0004R 


203 


OCiC74 of 

j jo /4__ax 


4 59 


7 A7 
/.u / 


19 

1Z 


0 00049 


204 


A19AQ at 


5 43 
J.HJ 


ft 00 
0.7U 


14 3 

1*T»J 


0 00051 


205 


%fnfn at 


R 33 

O.J J 


3 05 


4 43 

*T.*TJ 


0 00052 


206 


37146 at 


13.3 


5.24 


4.90 


0.00055 


207 


867_s_at 


13.7 


7.24 


3.57 


0.00055 


208 


33500J_at 


53.9 


32.3 


26.4 


0.00056 


209 


1575_at 


7.71 


5.67 


3.57 


0.00056 


210 
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Qualifier 


Ttacplina 

JLJaaCllIlC 

(n=110) 


8wks 
Average 
(n=110) 


16wks 
Average 
(n=110) 


A WW A 

P-Value 


CPS 


40088 at 


22 4 


18 7 
Xo. / 


10 9 
xu.z 


0 00060 


211 

Aril 


37742 at 


6 19 


Q 81 


14 5 


0 00067 


213 

X mJ 


35992 at 


13 1 


Q 33 


5 19 


0 00079 
u.uuu /z 


914 

Z. ±*T 


35710 s at 


6 90 


1 0 6 


1 3 0 


0 00073 


916 

Liu 


39119 s at 


19 9 


18 1 

xo.x 


31 3 

1 D X .J 


0 00074 

U.UUVJ /*+ 


917 
zx / 


34857 at 

J*tOJ / al 


14 0 


Q 43 


/ 6 90 

f O.Z7 


0 00078 

U.UUU / O 


918 ' 
zx o 


41 1 08 at 
*rl X2*0 a I 


96 


46 O 
40. U 


67 8 
o/.o 


0 0008O 


910 


^/M76 r «t 
j*tH / U__T_al 


1 1 5 
1 X.J 


8 48 
o.*fo 


3 81 
3. OX 


0 00084 
U.UUU54 


990 
ZZU 


35772_at 


5.62 


2.71 


2.95 


0.00086 


221 


- -34660_at~ *■» 


13.1 


20.3 


28.4 


0.00090 


222 - 


35648_at 


14.4 


11.1 


5.71 


0.00097 


223 


AFFX- 
HUMRGE 
/Ml 0098 3 at 


65.8 


30.3 


34.9 


0.00099 


225 


*W\0A at 

d x jz*f at 


7 A*\ 


3 71 

J. /X 


3 48 


O OOOQO 

u.uuuyy 


776 
ZZO 


1j7J al 


7 1 O 


17 7 
XZ. / 


14 7 
X*h / 


O 001 o 
U.UUXU 


777 
ZZ / 


^9066 o- nt 
OZUOO^^g al 


77 7 
Z /.Z 


77 A 
ZZ.t- 


19 9 
X z.z 


0 0010 

U.UUXU 


998 
ZZO 


1 1 07 c at 
X X U 1 o al 


70 0 
zu.u 


30 8 

J7.0 


60 


0 0010 

U.UUXU 


990 

ZZ7 . 


1 007 c at 
X U7 / o al 


90 0 


91 7 
zx. / 


0 86 


0 001 1 

U.UU1 X 


930 

Z JU 


^5501 at 


A 48 
*t.*to 


10 
xu 


7 86 
/ .oo 


0 001 1 

VJ.UUX X 


931 

Z J X 


^9815 at 
OZOXJ__al 


17 9 
X / .Z 


7 86 
/.oo 


6 33 


0 001 1 

U.UUX X 


939 

ZJZ 


^69^1 at 


90 6 
ZU.O 


15 7 

X3. / 


0 90 

7.^7 


0 001 1 

U.Uul X 


! 933 


aoooo c at 
jOOZj 5 al 


10 8 
XU.O 


Q 76 


5 10 
J. XU 


0 0019 
u.uuxz 


934 
ZJH 


39618 at 
jZOIO al 


6 05 


0 43 


19 3 


0 0013 


935 


7007 c at 
Z U y Z__b__al 


' 0 57 


6 69 
o.oz 


9 67 
z.u / 


0 0014 


936 
zoo 


371 84 at 

J / X O** al 


8 69 
o.oz 


9 67 

Z.O / 


9 81 


0 0014 


937 

Z«7 / 


37120_at 


15.8 


8.71 


3.48 


0.0014 


238 


1237_at 


79.6 


66.5 


32.8 


0.0015 


239 


37310_at 


9.76 


9.38 


3.95 


0.0016 


241 


40541_at 


13 0 

XJ.U 


6.10 


8.90 


0.0016 


242 


31346_at 


4.90 


10.5 


7.10 


0.0017 


244 ! 


37875_at 


11.4 


5.52 


5.67 


0.0017 


245 


38125_at 


8.24 


3.29 


3.19 


0.0018 


246 
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Qualifier 


Baseline 
(n=110) 


8wks 
Average 
(n=110) 


16wks 
Average 
(n=110) 


AiNUVA 

P-Value 


CPS 


ojy_at 


A 

22.0 


iy.3 


O.oO 


A AA1 A 

u.uuiy 




1CTAO a+ 

3!) fyZjBX 


O AC 

o.Oj 


C <7 


1 CO 

3.D / 


a onon 




34y4o_at 


C yfQ 


Q 1 A 
O.l 4 


110 
1 1.2 




9^0 


3395o__at 


11 / 
ll.O 


1 a c 
lO.J 


O/l 7 

24./ 


A AA01 
U.UUZ3 


9^^ 


32975_g_at 


8.00 


y| OA 

4.yo 


/i 00 
4.zy 


a aaoa 

U.UUZ4 


0^A 


38582_at 


£. AC 
6.95 


O AA 

2.90 


1 o/i 
3.24 


A AAO^ 
U.UU2D 


0^^ 


18l_&_at 


1 A 

12 


1 A O 

19.2 


OA >l 

29.4 


a nnoc. 
U.UU2D 


0^A 


37864_s_at 


1 A A 

10.0 


0.29 


1 vll 

3.43 


a AAOO 

U.002o 


0^7 


34704_r_at 


8,71 


4.00 


O/l 

4.24 


a AAOO 

U.UU2o 


O^Q 


32163_f_at 


1 A O 

10.2 


£L 1 A 
O.10 


4.7o 


a AAOO 

U.002y 


0f:o 1 ' 


4ll89_at 


1 A 

14 


1 A 
10 


0.0/ 


A AA1A 

U.UU3U 


O^A 


A 1 /AO _X 

4l698_at 


0.8I 


£ O/I 
0.24 


3.33 


A AA1A 
U.UU3U 


0A1 
ZOl 


40456_at 


1 1 CO 

11.52 


1 A O 

10.2 


c <o 

D.OZ 


A AA11 

U.UU3 1 


OAO 


41227_at 


15.8 


7.52 


IAD 
lO.O 


a nnii 
U.0033 




0^1 aa .~x 

36100_at 


O A A 

24.4 


1 A Q 

14.o 


1 O A 
Iz.U 


A AAI^ 


0^4 


O /" /I 1 1 — „_x 

36411_s_at 


1 Xt A 

16.2 


O 1 A 

o.l9 


7 OA 

/.yu 


A AAIO 
U.UU3 / 


OA^ 


A 1 1 Z"*7 ~*x 

41l67_at 


1 A *7 

10.7 


c 11 

0.33 


c 11 

D.33 


A AA1T 


0£A 
ZOO 


/I 1 0 1 O ^.x 

41819_ at 


4.4o 


A 71 


0 OO 

y.zy 


A AA17 


0^7 


38389_at 


3.7o 


£ AC 

o.yo 


1 A 0 
1U.Z 


A AA1Q 


9£R 
ZOO 


37975_at 


1 y| A 

14.9 


1C 0 


A1 1 
41.1 


A AA^O 


0AQ 

zoy 


39640_at 


1 A *1 

14.7 


1A 
O.10 


7 oc 

/.yD 


A A041 


070 

\ z /u 


«■> OA 1 A —A 

380lO_at 


7.38 


1 OA 

3.90 


3.33 


A AAA1 
U.UU4J 


071 
Ail 


32675_at 


A 11 

9.33 


1 1 A 

13.0 


OA A 

zU.U 


A AAAA 
U.UU44 


070 
Z /Z 


1 f11 _x 

1531_at 


o.S2 


y| <0 
4.DZ 


1 OA 


A (\f\AA 


97^ 
Z / D 


4064o_at 


o.ol 


12. j. 


1 0 il 

iy.4 


A AA^C 


01 A 

Z /*T 


39474_s_at 


y| 71 
4. / I 


0 11 
2.33 


j.Uj 




975. 


344yo__at 


01 
21 


Oft 0 
Zo.U 






976 


3 /OD4_at 


A A*. 




8 Izl 


0 005^ 


977 
z / / 


zinc qi 


04 A 


152 9 
lo.Z 


J.tO 


0 0054 


978 

Xi / O 


38340 at 


16 7 


12.9 


8.10 


0.0056 


279 


1937_at 


31.6 


13.0 


9.62 


0.0059 


280 


41638_at 


13.3 


8.38 


6.10 


0.0059 


281 . 


36138_at 


21 


32.5 


42.5 


0.0060 


282 
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Qualifier 


Baseline 
(n=110) 


8wks 
Average 
(n=110) 


16wks 
Average 
(n=110) 


AJNv/VA 
P-Value 


CPS 


3Z4U /_jt_3* 


13./ 


A II 

6.33 


< 10 


U.UU01 


Zoj 


jol3i_at 


1 O 
1 / 


1A 1 

36.1 


o on 
Z.9U 


U.UU03 


Zo4 


39/ /D_at 


1 1 1 

11.7 


AO O 

22.7 


ia o 
3Z.O 


U.UU04 


00^ 
ZoD 


j l o_at 


r ia 

j. 19 


6.19 


1 A A 
10.0 


U.UU /j 


OCA 
ZoO 


ia/iao o* 

jy4UZ__at 


83.1 


CO O 
O/.Z 


1 O A 

, 1Z.4 

• 


U.UU / / 


0Q7 
Zo / 


36543_at 


7.57 


C AO 

5.62 


/ O AT 

I 2.6 / 


U.UU/9 


OOO 
Zoo 


l/oo_s_at 


7,33 


A AO 

6.62 


1 AO 

3.62 


A AAQ1 

u.uuyi 


OOO 

Zoy 


34702_jF_at 


10.6 


4.67 


1 OA 

3.86 


A AAAyl 

0.0094 


AAA 

Z90 


4095 l_at 


5.71 


O *7X? 

2.76 


o o^r 

2.86 


A AAAO 

0.0098 


oAi 
291 


35313_at 


5.43 


3 


o 10 

2.38 


0.010 


Z9z 


1520_s_at 


105 


1 1 ^ 
112 


16 


A A1 1 
0.011 


OAI 

293 


40l7l_at 


6.05 


8.14 


1 o o 
12.8 


A A1 1 
0.011 


AAC 

Z9D 


32162_r_at 


14.1 


7.86 


6 


A A1 O 

0.012 


O A A 

Z96 


I369_s_at 


365 


292 


i ^o 
163 


A A1 O 

0.012 • 


OAO 

297 


34490__t_at 


5.05 


O 01 

7.71 


1 A O 

10.2 


A A1 O 

0.012 


ode 
z9o 


367l0_at 


8.81 


tic 
13.5 


22.2 


A A1 1 

0.013 


O A A 

300 


37220_at 


8.24 


1 A A 

14.0 


OA A 

20.0 


A A1 1 

0.013 


1 A1 

301 


32003_at 


A AC 

9.05 


5.29 


/I 1Q 

4.3o 


A A1 1 


0.AO 

30Z 


1 AAO n 

1667_s_ at 


\ A A 

14.4 


A C7 

9.57 


/.14 


n on 
U.U13 


iAi 
jUj 


731_tjat 


O Ol 

/. /I 


/I A< ' 

4.UD 


1 OA 


U.U1 0 


j>U4 


39448_r_at 


o 10 

8.38 


4 


A A1 

4.4:5 


0 01 k 




40385_at 


O 

7 


1 1 A 
1 l.U 


O AT 
Z.O / 


U.Ul / 


j\J0 


*X1A(\1 of 




oo. / 


JQ.J 


u.UZO 


JU / 


41046_s_at 


5.52 


2.57 


3.19 


0.040 


308 


37456_at 


16 


3o;o 


37.6 


0.039 


309 


31828_r_at 


9.52 


2.52 


6 


0.047 


310 



[0038] Each qualifier has a corresponsding CCI-779 activity gene probe sequence 

(CPS), and each CPS can be derived from a corresponding SEQ ID NO depicted in Table 3. 
In many cases, each CPS consists of an unambiguous fragment, or the complement thereof, 
of the corresponding SEQ ID NO. In many other cases, each CPS also comprises at least 
one oligonucleotide probe of the corresponding qualifier. Each SEQ ID NO in Table 3 is a 
cDNA or genomic sequence, or the complement thereof, of a CCI-779 activity gene 
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represented by the qualifier that corresponds to the SEQ ID NO. Accordingly, each SEQ ID 
NO or its corresponding CPS can hybridize 'under stringent or nucleic acid array 
hybridization conditions to the RNA transcripts, or the complements thereof, of the 
represented CCI-779 activity gene. 

[0039] Each SE Q 1° N0 ma y have m Entrez nucleotide sequence database 

accession number (see Table 4). The Entrez nucleotide sequence database is maintained by 
the National Center of Biotechnology Information (NCBI), National Library of Medicine, 
Washington, DC. The database collects sequences from several sources, including 
GenBank, RefSeq, and PDB. 

[0040] Any ambiguous residue ("n") in a SEQ ID NO can be determined by 

numerous methods. In one embodiment, the ambiguous residues in a SEQ ID NO are 
determined by aligning the SEQ ID NO to a corresponding genomic sequence obtained 
from a human genome sequence database. In another embodiment, the ambiguous residues 
in a SEQ ID NO are determined based on the sequence of the corresponding Entrez 
accession number. In yet another embodiment, the ambiguous residues are determined by 
re-sequencing the SEQ ID NO. In general, oach "n" position in a SEQ ID NO represents at 
least one nucleotide selected from a, t, g, and c, or contains no nucleotide residue. 



-Table-3^ualifiersand-CoiTesponding-CPSs-and-SEO-ro^NOs 



Qualifier 


CPS 


SEQ ID NO 


34768_at 


1 


nucleotides 1777 to 2353 of SEQ ID NO: 1 


36097_at 


2 


nucleotides 1273 to 1774 of SEQ ID NO: 2 


36675_r_at 


3 


nucleotides 338 to 744 of SEQ ID NO: 3 


37940_f>t 


4 


the complement of nucleotides 305 to 502 of SEQ ID 

NO: 4 


34338_at 


5 


nucleotides 659 to 995 of SEQ ID NO: 5 


38991_at 


6 


the complement of nucleotides 382 to 527 of SEQ ID 

NO: 6 


38976_at 


7 


nucleotides 1107 to 1517 of SEQ ID NO: 7 


37905_r_at 


8 


nucleotides 1451 to 1961 of SEQ ID NO: 8 


38780_at 


9 


nucleotides 567 to 1095 of SEQ ID NO: 9 


41215_s_at 


10 


nucleotides 719 to 1049 of SEQ ID NO: 10 


35956_s_at 


11 


nucleotides 1360 to 1973 of SEQ ID NO: 1 1 


32332_at 


13 


nucleotides 1 190 to 1720 of SEQ ID NO: 12 


41551_at 


14 


the complement of nucleotides 223 to 388 of SEQ ID 
NO: 13 
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Qualifier 


CPS 


SEQ ID NO 


39332__at 


15 


nucleotides 1091 to 1182 of SEQ ID NO: 14 


4U/ /o__at 


16 


nucleotides 498 to 938 of SEQ ID NO: 15 


3zz3z_at 


17 


nucleotides 484 to 963 of SEQ ID NO: 16 


34310_at 


18 


SEQ ID NO: 17 


1 1 Oil 

1184_at 


19 


nucleotides 1 97 to 743 of SEQ ID NO: 18 


35688_g_at 


20 


SEQ ID NO: 19 


38981_at 


21 


/SEQ ID NO: 20 


37391_at 


23 


nucleotides '1022- 1395 of SEQ ID NO: 21 


39593_at 


24 


uic complement oi nucieoiiQes xh 10 xyy 01 djc\^ id jnlj. 

22 


32505_at 


25 


nucleotides 4 to 223 of SEQ ID NO: 23 


AFFX- ..... 
M27830 5 at 


"""""26 -> 


SEOIDNO-24 


35666_at 


27 


nucleotides 3050 to 3530 of SEQ ID NO: 25 


3906 l_at 


28 


nucleotides 425 to 948 of SEQ ID NO: 26 


39748_at 


29 


nucleotides 2675 to 3052 of SEQ ID NO: 27 


40505 at 


30 


the complement of nucleotides 52 to 286 of SEQ ID NO: 

28 


38597_fat 


31 


nucleotides 2196 to 2257 of SEQ ID NO: 29 


39545_at 


32 


SEQ ID NO: 30 


- — 34268-at 


— 33- - 


nucleotides 1088 to 1530 of SEQ ID NO: 31 


388l6_at 


34 


nucleotides 3244 to 3649 of SEQ ID NO: 32 


38732_at 


35 


nucleotides 762-1229 of SEQ ID NO: 33 


40607_at 


36 


nucleotides 5022-5382 of SEQ ID NO: 34 


37200_at 


37 


nucleotides 1406-1932 of SEQ ID NO: 35 


283_at 


| 38 


nucleotides 1426-1965 of SEQ ID NO: 36 


4075 7_at 


39 


nucleotides 393-818 of SEQ ID NO: 37 


384_at 


41 


SEQ ID NO: 38 


40814_at 


42 


nucleotides 1 123-1247 of SEQ ID NO: 39 


37027_at 


43 


SEQ ID NO: 40 


1760_s_at 


44 


nucleotides 2049-2393 of SEQ ID NO: 41 


1292_at 


45 


nucleotides 1 126-1654 of SEQ ID NO: 42 


38287_at 


46 


the complement of nucleotides 8-451 of SEQ ID NO: 43 


41591_at 


47 


the complement of nucleotides 39-547 of SEQ ID NO: 

44 


32475_at 


48 


nucleotides 1845-2329 of SEQ ID NO: 45 


40782 at 


49 


nucleotides 886-1289 of SEQ ID NO: 46 


503_at 


50 


nucleotides 13-342 of SEQ ID NO: 47 
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i^uauner 




MLVJ JNU 


on c n a t 


M 


nucleotides dd^-ooo oi ocsKI iu jnu. ho 


37185_at 


52 


nucleotides 1311-1761 of SEQ ID NO: 49 


32282_at 


53 


nucleotides 2851-3265 of SEQ ID NO: 50 


4098 l_at 


54 


nucleotides 2661-3257 of SEQ ID NO: 5 1 


34782_at 


55 


the complement of nucleotides 529-952 of SEQ ID NO: 

52 


37373_at 


56 


nucleotides 1339-1771 of SEQ ID NO: 53 


4001 8_at 


57 


nucleotides 5780 to 6213 of SEQ ID NO: 54 


31638_at 


58 


SEQ ID NO: 55 


38191_at 


59 


the complement of nucleotides 171-451 of SEQ ID NO: 

56 


33396_at 


60 


dJqQ id jnu: !>/ 


38104_at 


61 


CCA TT\ XTf>. <0 

oJtiQ NUI jo 


32193_at 


62 


SEQ ID NO: 59 


38350_f_at 


63 


SEQ ID NO: 60 


1985 S at 


04 . 


nucleotides yuoiu oi ony id jnuj. oi 


38598_at 


65 


the cnmnlement of tmcleotides 149-213 of SEO ID NO: 

ll.lv vV/lllL/lwJLLXwlll. 11U vl v V UUvO X ~ ^ 4*±+J vx hJ.i_^v^ ai/ j. ^ 

62 


41446_fat 


66 


the complement of nucleotides 28-303 of SEQ ID NO: 

63 


A 1 A ^ /T _a 

41436_at 


67 


nucleotides 4oo /-4yu / oi dei^ iu jnu. 04 


36672_at 


68 


nucleotides i!>uy-iy / / 01 iu jnu. 00 


3871 7_at 


70 


nucleotides l J to 1/3/ 01 orsv2 ^ J^u. 00 


32183_at 


71 


nucleotides ziyp to zodi 01 any il; jnu. 0 / 


32737_at 


72 


nucleotides loi to ooy 01 oiivj iu jnu. 00 


40140_at 


73 


nucleotides 2o2o to J3do 01 oiiv^ lu inu. oy 


4081 8_at 


74 


nucleotides MjoOJou oi 2>rA^ jlu jnu. /u 


34787_at 


75 


nucleotides j^-zoooio 01 11/ inu. /1 


38652_at 


7o 


nucleotides ozy-i jjj oi ohi^ ll^ in^j. 


41812_s_at 


77 


nucleotides 3618-4192 of SEQ ID NO: 73 


31622_f_at 


78 


SEQ ID NO: 74 


40828_at 


79 


nucleotides 4472-4977 of SEQ ID NO: 75 


39693_at 


80 


the complement of nucleotides 55-554 of SEQ ID NO: 

76 


40994 at 


81 


nucleotides 2004 to 2502 of SEQ ID NO: 77 


39802_at 


83 


nucleotides 444 to 991 of SEQ ID NO: 78 


40567_at 


84 


nucleotides 3868 to 3916 of SEQ ID NO: 79. 


3851 8_at 


85 


nucleotides 3841-4076 of SEQ ID NO: 80 
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GIT ft "m lVffc 


1 3£8 at 


CO 
O / 


nucleotides 4459-4ooj oi oeki id inu. oi 


A1 at 


QO 

oo 


nnolarv+^on i i Al OC + rt 1 1 A7A1 r\f CThO TO X!Y"V 80 

nucleotides 1 10125 to 1 lu/ui oi onv^ jx/ in\j. oz 


/tl ££Q of 

4iooy_ai 


on 

oy 


nucleotides 4596 to 5U4y oi oiiv^ iu inu. 


1£/188 at 

304oo__ai 


on 


tinnlnM^an A ACQ +/-\ C/1/10 r>-T CCA TO MO* 8A 

nucleotides 4959 tO j44y OI olzKl ID INU. oh 


i r\C\C of 

1 uio_ai 


01 


nucleotides /o4 to 13 lo. oi or\i id inu. od 


lAiiO^. at 

304yD__ai 


OO 

y2 


qpa TO "KTO* 8£ 
ollv«J ID INU. oO 


4UD /y_at 


93 


| OXiV^ ID INU. 0/ 


3660 /_at » 


C\A 

94 


Hie complement oi 201545 to z0zi3i 01 olzki iu jnu. oo 


O/IOI 1 «4- 

34311_at 


95 


HCA TOO /n-TCTJA TTl \Tn« CO 

nucleotides 360 to ill oi oiiv yd inu. 


35785_at 


96 


nucleotides 160 to 436 ol oxiv2 id inu. yu 


40622_r_at 


97 


pTJA 1T\ TVT/^V. A1 

bhKl ID INU! 91 


33777_at 


98 


CC/"\ TT\ XT A. AO 

oxiQ id inu: y2 


38220_at 


99 


««AU/v^/J/»n 1 Q A/1 4.— A ICO A f CUn TTl XTW OI 

nucleotides 3894 to 435 / oi id inu. yo 


lo65_s_at 


100 


CUA TTl XT/*^« A/1 

olil^ ID JNU. 94 


1 /CA1 m *»4. 

1693_s_at 


1 A1 

101 


CT3A TTl XTO« QC 

oilKl ID JNU. yD 


32612_at 


1 no 
102 


ni/^la^+i/lor OA11 O/l AC /vf OThO TTl XJO« Q/\ 

nucleotides 2033 to z4io oi oi^v^ id inu. yo 


4l045_at 


103 


-«*U**J 1 CA/Z 1 AOI -4? CT7 A TTl XTA. AO 

nucleotides 1546 to 1973 oi bhQ id INU. y / 


3723 3_at 


1 A A 

104 


CtJA TTl XTft« OC 

bJbv^ id inu: yo 


37967_at 


1 AC 

105 


«„«1«A+^ Qn OOO /CI C rt «TQT3PI TTl XTO» OO 

nucleotides 2 / 8 to 635 oi oni^ id inu. yy 


OA4n _ ^.X 

2049_s_at 


106 


O/IO 1 AA1 n-PCPA TTl XTO» 1 AA 

nucleotides o4z to 1443 01 oxiv^ id inu. iuu 


o r\ An*7 — x 

39997_at 


1 AT 

107 


CtJA TTl XTO- 1A1 
bliv£ 1U INU. 1U1 


41332_at 


1 AO 

108 


^n^loAfi'rlQo £K i rt 1 1 <o A fQCA TTl XJO» 1 AO 

nucleotides 635 to 1 loy 01 piivi inu. iuz 


IIP _A 

115_at 


1 AA 
109 


1AQ1 ozTAC r\-f QThO TTl XTO* 1 Al 

nucleotides 3Uo3-3ouo oi oriv^ yd inu. ivd 


40964_at 


1 1 A 
1 10 


QTJO TTl XTO» 1 AA 
o.Cv£ ID INU. 1U4 


35303_at 


111 
111 


cpA TTl XTfV 1 A^ 
O-Di^ ID INU. 1U3 


40742_at 


1 1 o 
112 


Munlonti'/loP 1 *r\ 1 01 0 rvF QT?0 TTl XIO* 1 C\f\ 

. nucleoli aes 1001 10 iyiz oi ony inv^. iuo 


37955_at 


1 13 


nucleotides coy to /ju oi ojca^ ±d rnxj, iu/ 


3969o_at 


11/1 
114 


r^nolDAti^oc COO frt 1 AA8 Af Q/RO TTl XTO* 1 08 

nucieouucs oyy 10 iuoo oi duv^ wj w\j» iuo 


32586_at 


115 


niinUAti'^Dc CA^Q ccoc n f cpo TTl MO- 100 

nucleotides Duoy xo ddod oi oily 11^ iNvy. ivy 


40159_r_at- 


116 


nnpionf^Ao Q70 1 Af wo rn MO* 110 
nucieouoes y /u-i ^41 oi dev^ jlu 


32635_at 


11/ 


nnplpAfl'/lAC ^OilO tA ^4.94 Af ^IPO TO XJO' 111 

nucieouoes oz4u xo jhzh 01 ojdvc aj-/ i><vy. ± i i 


1CO/IA 0 + 

35o40_at 


1 1Q 

1 lo 


mir>1pnt^PO ^^Q tA 1 1 K\ Af 9FO TO "MO* 119 

nucieouQes dj" 10 1 ioj ui ocy n-/ i>ivy. i iz 


17661 at 


119 

i xy 


nucleotides 4061-4398 of SEO ID NO: 113 


411_i_at 


120 


nucleotides 281 to 318 of SEQ ID NO: 114 


35012_at 


121 


nucleotides 1070 to 1667 of SEQ ID NO: 115 


38760_f_at 


122 


nucleotides 1014 to 1263 of SEQ ID NO: 116 
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Qualilier 


Crb 




36474_at 


123 


nucleotides 3881 to 4U^o 01 disq ID NO: 11 / 


A 1 A A A 

41440_at 


124 


nucleotides 655 to 933 01 oriv^ ID NO: Ho 


39953_i_at 


125 


nucleotides 5549 to 5598 01 biiQ ID NO: 1 19 


245_at 


126 


nucleotides 1795 to 2323 ot bliQ ID NO: 120 


37l8l_at 


127 


nucleotides 375 to 947 ot Sfc,Q ID NO: 121 


39799_at 


128 


SEQ ID NO: 122 


39320_at 


129 


i j c re\ x^ 1 101 ^-T CUA TT"\ XT/"\. 1 

nucleotides 569 to 1 121 or bbQ ID NO: 123 


794_at 


130 


nucleotides 1484 to 2024 of SEQ ID NO: 124 


36091_at 


131 


nucleotides 1806 to 1973 of SEQ ID NO: 125 


37187_at 


133 


nucleotides 504-946 of SEQ ID NO: 126 


38338 at 


134 


the complement of nucleotides 61 to 470 of SEQ ID NO: 

- 127 


34760_at 


135 


nucleotides 3404 to 3694 of SEQ ID NO: 128 . 


33133_at 


136 


SEQ ID NO: 129 


32904_at 


137 


SEQ ID NO: 130 


37359_at 


138 


nucleotides 798 to 1300 of SEQ ID NO: I3l 


40485 at 


139 


the complement of nucleotides 35-188 of SEQ ID NO: 

132 


3721 5_at 


140 


SEQ ID NO: 133 


40274_at 


141 


1 i • J r /* 1 TO _ o OTA 11 \ XTA. 1 O A 

nucleotides 561-736 of SEQ ID NO: 134 


41174_at 


143 


nucleotides 3461 to 4011 oi SEQ ID NO: 135 


1715_at 


144 


nucleotides 1162 to 1402 of SEQ ID NO: 136 


41503_at 


145 


nucleotides 3580 to 4066 of SEQ ID NO: 137 


40390_at 


146 


nucleotides 978 to 1393 of SEQ ID NO: 138 


4081 l_at 


147 


nucleotides 4475 to 4934 of SEQ ID NO: 139 


31499_s_at 


148 


nucleotides 251-854 ot SEQ ID NO: 140 


34023_at 


149 


nucleotides 570 to 1026 of SEQ ID NO: 141 


1125_s_at 


150 


SEQ ID NO: 142 


36617_at 


151 


nucleotides 691 to 819 of SEQ ID NO: 143 


36337 at 


152 


the complement of nucleotides 54-372 of SEQ ID NO: 

144 


36l6l_at 


153 


nucleotides 5172 to 5682 ot sEQ id no. 143 


1433 g at 


154 


nucleotides 1907 to 2267 of SEQ ID NO: 146 


39043_at 


155 


nucleotides 967 to 1366 of SEQ ID NO: 147 


373ll_at 


156 


SEQ ID NO: 148 


37009_at 


157 


the complement of nucleotides 3527-3999 of SEQ ID 
NO: 149 


33942_s_at 


158 


nucleotides 3457 to 3843 of SEQ ID NO: 150 
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Quahfier 


CPS 




39641_at 


159 


nucleotides 759 to 1 m oi oJC/i^j lu inv_/. i ji 


529_at 


160 


nucleotides 1^95 to z3iy oi ohv^; jlu inu. i jz 


32199_at 


I6l 


i Anno xrn^Q /\f QPH m XJY"V 1 

nucleotides 477o to !>Ujy oi dea^ iu inu. idj 


1426_at 


162 


nucleotides 2046 to 2334 oi o&ki iu jnu. 134 


31851_at 


163 


nucleotides 802 to 1201 oi any id inu. idd 


34378_at 


164 


nucleotides 121 /-1314 oi o&ki iu inu. ido 


36173 r at 


165 


nucleotides 4 /Do to 4!OU oi oilki hj in u. i d i 


35275_at 


166 


nucleotides 3D 32 to oy id oi oea^ ii-j inu. i jo 


41184_s_at 


168 


ob\l ID JNU. IDy 


38391_at 


170 


i a* j _ nrn 1 1 Q>l CCA TT\ XT A. 1 /CA 

nucleotides 750 to 1 lo4 oi o&ki iu jnu. lou 


39829_at 


171 


nucleotides 799 to 1337 oi obQ IU JNU. loi 


37587_at 


172 


nucleotides 599 to 1 167 oi bbvi IU JNU. 102 


34022_at 


173 


nucleotides 426-993 of SbQ ID IMU. lo3 


34188_at 


175 


nucleotides 1283 to 1708 oi bbv^ ID JNU. 104 


37647_at 


176 


nucleotides 1702 to 2244 oi d±si^ ID JNU. loo 


40089_at 


177 


i j» j _ 1 i OA x-^ 1 vl ^.-T CCA TT\ XTW 1 

nucleotides 1189 to 1234 oi b«bvj ID JNU. loo 


39982_r_at 


178 


nucleotides 1075 to 1489 oi bbQ UD JNU. lo A 


32202_at 


179 


i . • j i r\r> o o>IAc! «^ CT? A TT*\ XT/^« 1 

nucleotides 1928 to 2496 oi bbC^ JLU JNU. loo 


35036 at 


180 


nucleotides 2895 to 3261 ot bbQ ID JNU. loy 


906_at 


181 


nucleotides 1993 to 2533 oi bhKl ID JNU. l /u 


36033_at 


182 


nucleotides 562 to y lo oi oty iu inu. i / i 


36766_at 


183 


nucleotides I0/-000 01 oc\l iu xnu. 1 il 


38893_at 


184 


nucleotides 29281 to 2^344 01 oxiv^ tu inu. 1 / j 


37374_at 


185 


1 nucleotides i4CO-iy3y 01 ojdv jlu inu. 1 /** 


36599_at 


186 


obVi ID JNU. 1 / J 


464_s_at 


187 


nucleotides 869-lU4o oi oxsi^ id inu. i /o 


32533_s_at 


188 


nucleotides 91 - 4o4 oi obvi id jnu. 1 / / 


36496_at 


189 


nucleotides lUoi-iM4 oi oxiv ^d jnu. i /o 


36280_at 


190 


1 nucleotides 53 j-y^3 oi obv^ id jnu. i /y 


38126_at 


191 


OliV^ ID INU. loU 


3701 l_at 


•4 r\r\ 

192 


nucleoudes 3 / to 4ou oi onv^ -n- 1 inu. ioi 


39166_s_at 


193 


nucleotides 1583-1790 of SEQ ID NO: 182 


1825_at 


195 


nucleotides 6960-7542 of SEQ ID NO: 183 


41577_at 


196 


nucleotides 5804-6213 of SEQ ID NO: 184 


33161_at 


197 


the complement of nucleotides 1 to 270 of SEQ ID NO: 

185 
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TPS 


cfo rn no 


11438 c at 


12*0 


mioi^Ati'H^c 3947 - 3790 of SEO ID NO: 186 


jOj /Z_ai 


1QQ 


nii/*1«»nti'/foc 9437-309Q of SEO ID NO: 187 


OfiO^l 0 at 


900 


nn^i^ti'^c 9713 -39Q4 nf SEO ID NO' 188 


700*7*7 at 


om 

ZUI 


nnpUnti/lae 9074-9906 Af SEO ID NO' 189 


32264_at 


202 


nucleotides 493-755 of SEQ ID NO: 190 


32184_at 


203 


nucleotides 1691-2175 of SEQ ID NO: 191 


35674_at 


204 


nucleotides 3798-4194 of SEQ ID NO: 192 


41249_at 


205 


the complement of nucleotides 63056-63481 of SEQ ID 

NO: 193 


367o2_at 


2Uo 


mipioA+^ac 1 1 oo 1 £30 nf SFO TD NO* 1 Q4 
nucleoli QcS i izy-ioju oi ony \sj i^w. i7t 


3714o_at 


OAT 

20/ 


nucleotides Doy /-ozoz oi oni^ u-j i7j 


— — 867~s_at 


OAO 

2Uo 


tuiAiaAti'/lor 1 C0 1 1 0AS Af TD NO' 1 Qf\ 

nucieotiaes iozi-iy*K) 01 xu 1^0 - 


33500_i_at 


OAA 

2U9 


^nolonfJ/loo 19S3 1 9Q£ nf SFO TD NO- 1 Q7 

nucieotiaes xzoD-xzyo 01 oju\£ ■ll' i^v. xy 1 


1575_at 


O 1 A 

21U 


^noi«/^fiVloc A940 < _4S8S Af SFO TD NO* 1QR 
nucieonaes *r*t*TU-*t»>©.j 01 ony 11^ nu. 170 


40088__at 


01 1 

211 


«iiAlflA*iMoe ££SS 7907 Af SFO TD NO* 1 Q0 
nucieonaes oodd~ / f oi d-dv^ ^xj vwj* 177 


37742__at 


OI *2 

213 


niin1enfi/1po 1 QAf\ 9978 Af SFO TD NO' 200 

nucieonaes iy*+o-zz/o ox ocy ij-/ i^v^. z-w 


35992_at 


01 /i 
214 


™^l^f;/Uc 1 907-1 6S9 Af SFO ID NO' 201 
nucieonaes izu/"iujz ui ocy ij.-* i^w. ^ui 


35710_s_at 


OK 
216 


tMi/xlonfinpo £3 £30 Af SFO TD NO* 909 
nucieonaes dj"Djv oi dih^ xxj iy\j. aua 


391 ly s at 


0 1 1 

21 / 


r»iinl4>AtiH^c 408-797 Af SFO TD NO' 903 
nucieonaes Huo-/Zr/ ui ojla^ xxj x^kj, auj 


34o57_at- — 


OI Q 
21o 


mi^l *»nti a 1 1 9/fiO.1 77Q Af SFO TD NO* 904 
— — nucieonaes~i-zoy-i-/ /y-sjx oj_a^ xxj x^\j* z.u*r 


41198_at 


219 


nucleotides 1589-2106 of SEQ ID NO: 205 


34476_r_at 


220 


nucleotides 4012-4358 of SEQ ID NO: 206 


35772_at 


221 


SEQ ID NO: 207 


34660_at 


222 


the complement of nucleotides 40-615 of SEQ ID NO: 

208 


35648_at 


223 


SEQ ID NO: 209 


AFFX-HUMRGE 
Ml 0098 3 at 


225 


nucleotides 62-1969 of SEQ ID NO: 210 


31324_at 


00£ 

220 


Tni^1f»Atir1f*c SI -308 nf SEO TD NO* 211 


1 one n * 

13y5_at 


00*7 
22/ 


Tmnl^AtiH^c 4^^-10lQ nf SFO TD NO* 212 


32066 g at 


000 
22o 


SFO TD NO* 911 


1107 s at 


229 


nucleotides 7-602 of SEQ ID NO: 214 


1097_s_*t 


230 


SEQ ID NO: 215 


35591_at 


231 


nucleotides 1711-2087 of SEQ ID NO: 216 


32815_at 


232 


the complement of nucleotides 49-265 of SEQ ID MU. 

217 


3623 l_at 


233 


SEQ ID NO: 218 


38823 s at 


234 


the complement of nucleotides 57-557 of SEQ ID NO: 
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CPS 


SEO ID NO 






219 


3961 8 at 


235 


nucleotides 489-909 of SEO ID NO: 220 


7009 c at 




nucleotides 824-1229 of SEO ID NO: 221 


371 84 at 
J / 1 o4_al 


937 


nucleotides 1631-2037 of SEO ID NO: 222 

llllwl wwLlvXwO X wl £>\J U I V/X vja-*v^ * ' w * ! 


37100 of 

o /xzu_aT 


938 
zoo 


nucleotides 1870-2379 of SEO ID NO: 223 


1737 at 


93Q 


nucleotides 658-1204 of SEO ID NO: 224 


37310 nt 


941 

Z*tx 


nucleotides 6668-7083 of SEO ID NO: 225 


/I0^2l1 of 
4U04 l_al 


949 

ZHZ 


nucleotides 1030-1412 of SEO ID NO: 226 

llUvlvUUUvo 1 UJv vi wtvy 


3 1 3,46 af 


ZHt 


nucleotides 647-1187 of SEO ID NO: 227 


17Q7** of 


94^ 


nucleotides 9330-2672 of SEO ID NO: 228 

llllwlwvrlllXww x-Jw»w J~\J 1 vX t*JJ—t\£ i ■ * x-*v« +^+"*~> 


IQIO^ of 


946 
ZHO 


nucleotides 2357-2473 of SEO ID NO: 229 

llUvlGvrllVX&D »/ / <x»~ / •/ wX U-LJN( ■*■ ' ' 




947 


nucleotides 4693-5071 of SEO it) NO: 230 


35/y2_at 


9^Q 


nnrlpntides 884-1 1 74 of SEO ID NO: 231 


34946_at 


250 


nucleotides 386-895 of SEQ ID NO: 232 


33956_at 


253 


nucleotides 90-594 of SEQ ID NO: 233 


32975_ft_at 


254 


nucleotides 18770-18875 of SEQ ID NO: 234 


38582_at 


255 


the complement of nucleotides 40 to 288 of SEQ ID NO: 

ZJD 


lOI rr o-f 

181 g at 


9^6 
ZOO 


nucleotides 1464-1864 of SEO ID NO: 236 


170/C/l C of 


9^7 
ZJ / 


nucleotides 1108-1568 of SEO ID NO: 237 


1/1 70A i* of 


9^8 


SEO ID NO: 238 


171 A3 «P of 


9^0 


SEO ID NO: 239 


/111 CO of 

4iioy__at 


960 
ZOU 


nucleotides 1 161-1608 of SEO ID NO: 240 

llUwlwvTXvXV'O X 1 VX x. www wX wxv\^ xx/ x^^-^» » w 


A 1 AOQ of 

4ioy8_at 


961 
ZOl 


nucleotides 45240-45830 of SEO ID NO: 241 


4U4jO_ai 


969 

ZDZ 


nucleotides 733-1310 of SEO ID NO: 242 


/11777 of 

4izz /_ai 


963 


nucleotides 35752-35804 of SEO ID NO: 243 


361 OO af 

30 luteal 


964 


nucleotides 2613-3118 of SEO ID NO: 244 


3<A1 1 c at 
JD411 S al 


96S 

ZOJ 


nucleotides 1623-2080 of SEO ID NO: 245 


/111 67 of 

4iio/_at 


966 
zoo 


nucleotides 1596-2100 of SEO ID NO: 246 

UUvlvwUUiVD X «/ w **xvw vx *—i ■ ,■ ■*- ' — " 




967 
zu / 


nucleotides 2242-2578 of SEO ID NO: 247 


3C3CQ of 


968 
zuo 


nucleotides 960-1377 of SEO ID NO: 248 


37Q7^ of 


96Q 


nucleotides 3684-4231 of SEQ ID NO: 249 


39640_at 


270 


nucleotides 2415-2938 of SEQ ID NO: 250 


38010_at 


271 


nucleotides 1044-1494 of SEQ ID NO: 251 


32675_at 


272 


nucleotides 972 - 1411 of SEQ ID NO: 252 


1531 at 


273 


nucleotides 1691-1891 of SEQ ID NO: 253 
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Qualifier 


CPS 


bhiVl JJJ INU 


40646_at 


274 


nucleotides 249o-3Uiz oi SJiQ id jnu. zoh 


39474_s_at 


275 ; 


i . • j oo CAO a-TCCA FTl XTfV 

nucleotides 22-507 oi SEQ ID JNU. zjD 


34498_at 


276 


nucleotides 1434-iyo3 oi aisQ id jnu. zdo 


37654_at 


277 


nucleotides 158u-zU4i oi SEQ ID jnu. z:> / 


408_at 


Ann 

278 


nucleotides 1412-loM oi SEQ ID JNU. Zjo 


38340_at 


279 


nucleotides 4103-4434 ol SEQ ID MU. zDy 


1937_at 


280 


CTTTi^V TT\ XT/"\. n£.f\ 

SEQ ID JNU. 2oU 


41638_at 


281 


nucleotides 1696-1922 of SEQ ID JNU: zol 


36138_at 


282 


nucleotides 943-1477 oi SEQ ID JNU: zoz 


32407_f_at 


283 


nucleotides 3-280 ol SEQ ID NU. zoi 


38131_at 


284 


nucleotides 1240-1517 oi SEQ ID JNU. zo4 


39775_at 


285 


nucleotides 18180 -18249 ot SEQ ID JNU. 2o5 


318_at 


286 


nucleotides 601-838 ot SEQ ID JNU: zoo 


39402_at 


287 


nucleotides 927-1473 of SEQ ID INU: 267 


36543_at 


288 


nucleotides 1723-2013 of SEQ ID INU: 2oS 


1788_s_at 


289 


nucleotides 1622-2058 oi SEQ ID JNU: zw 


34702_f_at 


290 


nucleotides 534-575 of SEQ ID JNU: 27U 


4095 l_at 


291 


nucleotides 18oQ-20yy oi SEQ ID jnu. z/i 


35313_at 


292 


nucleotides o4l2-oo2o oi seq id jnu. z /z 


1520_s_at 


293 


OT7/^\ TT\ XT A. 0*71 

SEQ ID jnu: z/j 


40171_at 


295 


OCA TT\ XTO« OO/I 

SEQIDISU:Z/4 


32162_r_at 


296 


i I? >ioa /ion ocn tt\ xta» one 
nucleotides 420-480 ot SEQ ID JNU. I O 


1369_s_at 


297 


nucleotides 3645-4005 oi SEQ ID INU: z/o 


34490_f_at 


298 


SEQ ID NU: 277 


36710_at 


300 


i x«j n a ccn «f OUA T"p\ XT A. T7Q 

nucleotides 74-557 oi SEQ ID INU.vZ /o 


37220 at 


301 


OH? A "m XT A. OOO 

SEQ id jnu: z/y 


32003_at 


302 


nucleotides 1144-1487 of SEQ ID NO: 280 


1667_s_at 


303 


nucleotides 1531-1603 of SEQ ID NO: 281 


731_f_at 


304 


SEQ ID NO: 282 


39448 r at 


305 


the complement of nucleotides 46-344 of SEQ ID NO: 

283 


40385_at 


306 


nucleoli aes zu/-/*tz oi o.ea^ ■ii-' xnv-^. zot 




307 


nucleotides 1184-1653 of SEQ ID NO: 285 


41046 s at 


308 


nucleotides 5551-6046 of SEQ ID NO: 286 


37456_at 


309 


SEQ ID NO: 287 


j 31828_r_at 


310 


nucleotides 1750 to 2274 of SEQ ID NO: 288 
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Table 4. SEP ID Nos and Correspond in g Entrez Accession Nos 



SEO ID NO 


Entrez Accession No. 


Reported Source of Entrez Sequence 


1 
1 


at nannafi 


Homo sapiens mRNA; cDNA 
TOnJ7n5fi4F1 962 (from clone 
DKFZp564E1962) 


2 


M62831 


Human transcription factor ETR101 
mRNA 


■ 3 


J03191 


/ Human profilin mRNA 


4 


AA806768 






D49738 


Human cytoskeleton associated protein 
(CG22) mRNA 


6 


U55980 




7 


D44497 


Human mRNA for actin binding protein 
p57 


8 


X66436 


H.sapiens hsrl mRNA 


9 


J04794 


Human aldehyde reductase mRNA 


10 


D13891 


Human mRNA for Id-2H 


11 


U18467 


Human pregnancy-specific beta 1- 
glycoprotein 7 (PSG7) mRNA 


12 


X69433 


H.sapiens mRNA for mitochondrial 
isocitrate dehydrogenase (NADP*) 


13 


AW044624 




14 


AF035316 


Homo sapiens clone 23678 mRNA 


15 


AF035555 


Homo sapiens 17beta-hydroxysteroid 
dehydrogenase type 10/short chain L-3- 

hydroxyacyl-CoA dehydrogenase 
(HSD 1 7B 1 0/SCHAD) mRNA; nuclear 
gene for mitochondnal product. j 


16 


AF047181 


Untrin Qanien^ NADH-ubiauinone 
oxidoreductase subunit CI-SGDH mRNA 


18 


D45248 


Human mRNA for proteasome activator 
hPA28 subunit beta 


' 21 


X12451 


Human mRNA for pro-cathepsin L (major 

**Yrrpff*H nrntein M"RP^ 

vAUvlCU piVltwJXl lVXXwX f 


22 


AI432401 




23 


W28652 




24 


M27830 


Human 28S ribosomal RNA gene 


25 


U38276 


Human semaphorin IE family homolog 
mRNA 


26 


D28137 


Human mRNA for BST-2 
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SEQ ID NO 


Entrez Accession No. 


Reported Source of Entrez Sequence 


27 • 


AL050021 


Homo sapiens mRNA; cDNA 
DKJFZp564D016 (from clone 


28 


AA883502 




29 


D50402 


Human mRNA for NRAMP1 


31 


X91809 


H.sapiens mRNA for GAIP protein. 


32 


AF095791 


Homo sapiens TACC2 protein (TACC2) 
mRNA 


33 


X91788 


H.sapiens mRNA for Icln protein 


jn 




TTrtmn ^aniens N2A3 mRNA 


35 


J04162 


Human leukocyte IgG receptor (Fc- 
gamma-Kj iilkjn a 


36 


L16842 


Human ubiquinol cytochrome-c reductase 

_-._n_M.i_rM 1 ■_««>j^v_-__ t *vi 1/ \ A 

core l protein mKiN a 


37 


Ml 8737 


Human Hanukah factor serine protease 
(nurir ) misJN j\ 


39 


L40586 


Homo sapiens iduronate-2-sulphatase 

(IJJo ) mrUNA 


41 


D11327 


Wnman mT^^TA for "nrntfiin-tVTnsifie 

phosphatase 


42 


LI 1329 


Homo sapiens protein tyrosine 
phosphatase (PAC-1) mRNA 


43 


-AA808961 


— — ^ — - . 


4/1 






45 


AF025529 


Homo sapiens leucocyte immunoglobulin- 
liKe recepior-ou \i^iiv-u ) iiixvi^irv 


46 


AP061741 


Homo sapiens retinal short-chain 
uenyurogenase/reoucxasc rcioL/ivi in.i\_rN/\ 


47 


U37690 


Human RNA polymerase II subunit 
insivjrDiuj mivi>ir\ 


48 


M26683 


Human interferon gamma treatment 

lnuuciDic IHJVLN/\ 


49 


Y00630 


Human mRNA for Arg-Serpin 

^piasnunogcll aOllvdlur-llJiuuliAJi ^ i flat**) 


50 


U66047 


Homo sapiens clone Z3-1 placenta 
expressed mRNA from chromosome X 


51 


U00930 


Human clone C4E 1.63 (CAC)n/(GTG)n 
repeat-containing mRNA 


52 


AL021938 


Homo sapiens DNA sequence from PAC 

OWKd nn rVirnmn<5nme 6n22 3 Contains 

the JUMONJI gene for a hypothetical 
141.7 kD protein. Contains ESTs, STSs, a 
CA repeat polymorphism and genomic 
marker D6S260* 
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SEQ ID NO 


Entrez Accession No. 


Reported Source of Entrez Sequence 


53 


U27460 


Human undine dipnospnogiucose 
pyrophosphorylase mRNA 1 


54 


AB007870 


Homo sapiens KIAA0410 mRNA 




AI040181 i 




61 


X73066 


H.sapiens NM23-H1 mRNA 


62 


AI679353 


1 


64 


AJ224901 


Homo sapiens mRNA for ZNF198 protein. 


65 


LI 3977 


Human procarboxypeptidase mRNA 


66 


AL050159 


Homo sapiens miviN a, cjluna 
DKFZp586A0522 (from clone 
_ DKFZp586A0522) 


67 


M74002 


Human arginine-rich nuclear protein 
mRNA 


68 


M64595 


Human small G protein (Gx) mRNA 


69 


D76444 


Homo sapiens hkf-1 mRNA 


70 


D14041 


Homo sapiens mRNA for H-2K binding 
factor-2 


71 


■XQ3209 


H.sapiens mRNA for NRD1 convertase 


72 


AF070644 


Homo sapiens clone 24742 mRNA 
sequence 


73 


AB020713 


Homo sapiens mRNA for KIAA0906 
protein 


75 


D63476 


Human mRNA for KIAA0142 gene 


76 


N53547 




77 • 


L15388 


Human G protein-coupled receptor kinase 
(GRK5) mRNA 


78 


X72308 


Homo sapiens mRNA for monocyte 


79 


X01703 


Human gene for alpha-tubulin (b alpha 1) 


80 


Y18004 


Homo sapiens mRNA for SCML2 protein 


81 


M27492 


Human interleukin 1 receptor mRNA 


82 


AL008729 


Human DNA sequence from PAC 251 Al 
on chromosome 6p24 


83 


D83776 


Human mRNA for KIAA0191 gene 


84 


AB011542 


Homo sapiens mRNA for MEGF9 


85 


X68277 


H.sapiens CL 1 00 mRNA for protein 
tyrosine phosphatase 


88 


Z99716 


Human DNA sequence from clone CTA- 
250D10 on chromosome 22 Contains the 
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SEQ ID NO 


Entrez Accession No. 


Reported Source of Entrez Sequence 






genes for SREBF2 (sterol regulatory 
element binding transcription factor 2), 
^AGA(alpha-N-acetylgalactosaminidase), 
a gene similar to neuronal-specific septin 

J, a pSCUQOgCIIv DlUlllax Wj t\x\ x*. ^auvmuc 

nucleotide translocator 2), 2 mRNAs based 
on ESTs, a genomic marker D22S1 178, a 
CA repeat polymorpism, ESTs and a CpG 
island 


j 89 


X76648 


H.sapiens mRNA for glutaredoxin 


90 


W28281 




93 


U2Uy3o 


Human lymphocyte dihydropyrimidine 
dehydrogenase mRNA 


94 


M63193 


Human platelet-derived endothelial cell 
growth factor mRNA 


96 


X04412 


Human mRNA for plasma gelsolin 


97 


U77643 


Homo sapiens K12 protein precursor 
mRNA 


99 


AF000424 


Homo sapiens LSTl mRNA, cLSTl/C 
splice variant 


100 


M29039 


Human transactivator (jun-B) gene 


102 


D38251 


Homo sapiens mRNA for RPB5 (XAP4) 


103 


_ X14787 


Human mRNA for thrombospondin 


106 


Aifi /rem 

Mlo5yl 


Human hemopoietic cell protein-tyrosine 
kinase (HCK) gene 


107 


AB015631 


Homo sapiens mRNA for type II 
membrane protein 


108 


U51712 




109 


D86971 


Human mRNA for.KIAA02l7 gene 


110 


M55067 


Human 47-kD autosomal chronic 
granulomatous disease protein mRNA 


111 


AB029036 


TTomo saniens mRNA for KIAAl 1 13 
protein 


112 • 


AL050060 


Hnmo saniens mRNAi cDNA 
DKFZp566H073 (from clone 
DKFZp566H073) 


113 


J04027 


Human plasma membrane Ca2 + pumping 
ATPase mRNA 


114 


X57351 


Human 1-8D gene from interferon- 
mducible gene family 


115 


M81750 


H.sapiens myeloid cell nuclear 
differentiation antigen mRNA 


116 


U90546 


Human butyrophilin (BTF4) mRNA 
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^FO TD NO 




Reported Source of Entrez Sequence 


117 


AB018319 


Homo sapiens mRNA for KIAA0776 
protein 


118 


D82061 


Homo sapiens mRNA for a member of the 
snort-cnain aiconoi uenyurogendbe loiiuiy 


119 


AB014528 


Homo sapiens mRNA for KIAA0628 
protein 


120 


M25280 


Human lymph node homing receptor 
mRNA 




yv / U J j O 


TT <;»rnens Mnvl 7 mRNA 


123 


U13697 


Human interleukin 1 -beta converting 1 
enzyme isoiorm oeta (il,id\*e,) iuxxina 


124 


X62055 


H.sapiens PTP1C mRNA for protein- 
tyrosine phosphatase 1C 


"125 . 


AF051323 


Homo sapiens Src-associated adaptor 
pro tern \o/Vro ) mxviN/v 


1 0£ 




Human rvtokine /GRO-beta^ mRNA 


1 07 


Aioni 1 08 

/VLZUx 1UO 




I/O 


1-/ i 


Human mRNA for KIAA0022 eene 


131 


D14658 


Human mRNA for KIAAO 1 02 gene 


132 


AA176780 




1 j4 


T T480 1 % 


Mi i-m an F)-<dte hindm*? TYrotein @ene 

XXlXXXXCXXX i/'Ollv UXXXUXXXg YJl \J gviiv 






Homo sapiens Ran binding protein 2 


- - 1-3 5 . 


AF012086 


(KaniDr/aipnaj mrviN/\ 


136 


U37518 


Human TNF-related apoptosis inducing 
ligand TRAIL mRNA 


137 


AB020661 


Homo sapiens mRNA for KIAA0854 
protein 


118 


T0S037 


Human serine dehydratase mRNA 

_|_ _M_ \M III 1 W^^X 111V %A^rJLX T *AI.|*M*fc/V AAAAW A 


139 


AB011148 


Homo sapiens mRNA for KIAA0576 
pruicin 


140 


X16863 


Human Fc-gamma RH1-1 cDNA for Fc- 
gamma receptor IE- 1 (CD 16) 


141 


X06948 


Human mRNA for high affinity IgE 
receptor aipna-auouixit ^rv/mxij 


143 


X77956 


H.sapiens Idl mRNA 


144 


AI760801 




145 


M34175 


Human beta adaptin mRNA 


140 


uoouiy 


Homo sapiens mad protein homolog 
(hMAD-3) mRNA 


147 


AF006084 


Homo sapiens Arp2/3 protein complex 
subunit p41-Arc (ARC41) mRNA 


149 


AL035079 


Human DNA sequence from clone 53C18 
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SEQ ID NO 


Entrez Accession No. 


Reported Source of Entrez Sequence 






on chromosome 1 lpl2-13 


150 


AF004563 


Homo sapiens hUNC18b alternatively- 
spliced mKJN a 


151 


X52486 


Human mRNA for uracil-DNA 
glycosylase 


152 


U 15932 


Human dual-specificity protein 
phosphatase mRNA 


153 


U20489 


Human glomerular epithelial protein 1 
(uLbr r 1 ) mKJN A 


154 


D89077 


Homo sapiens mRNA for Src-like adapter 
protein 


155 


AJ224819 


Homo sapiens mRNA for candidate tumor 
suppressor involved in B-CLL 


-156 — 


. * -X97324 


H.sapiens mRNA for adipophilin -~ 


157 


AF002163 


Homo sapiens delta-adaptin mRNA 


158 


AL050025 


Homo sapiens mRNA; cDNA 
DKPZp564D066 (from clone 
DKFZp564D066) 


160 


M94345 


Homo sapiens macrophage capping protein 
mRNA 


161 


AB016811 


Homo saniens mRNA for ADP 
ribosylation factor-like protein 


162 


_ . S43855 ..... 


recoverin Dhotoreceptor protein [human, 

IWvT Villi W*1V W* VW «r |^ » W* L 9 

■ - retinarmRNA] — 


163 


M36821 


Human cytokine (GRO-gamma) mRNA 


164 


AF070606 


Homo sapiens clone 2441 1 mRNA 
sequence 


165 


Muzo4U 


Unmoti cl^>\r^^\^Y\7nr\7^ HvHlTilflQfi nYRNA 


166 


AJ224442 


Homo sapiens mRNA for putative 
methyltransferase 


167 


D13265 


Human mRNA for macrophage scavenger 
receptor type II 


168 


U67322 


Human HBV associated factor (XAP4) 
mRNA 


169 


U94333 


Human Clq/MBL/SPA receptor ClqR(p) 
mRNA 


170 


L78440 


Homo sapiens STAT4 mRNA 


171 


AL049309 


Homo sapiens mRNA; cDN A 
DKFZp564B176 (from clone 
DKFZp564B176) 


172 


X55988 


Human EDN mRNA for eosinophil 
derived neurotoxin 


173 


AL008637 


Human DNA sequence from clone GTA- 
833B7 on chromosome 22ql2.3-13.2 
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SEQIDNO 


Entrez Accession No. 


Reported Source of Entrez Sequence 






Contains the NCF4 gene for cytosolic 
neutrophil factor 4 (40kD), the 5' part of 
inc i^»or Zjvjj gone xur oiiuiut/y ic- 
macrophage low-affinity colony 
Qtimiilflfino- ftr tor 1 recentor beta. ESTs 

Oil All UidlAAAci lav Ivy A X. ivvv|/lUl vvWlj X^vJ X Oj 

STSs and GSSs 


174 


M82809 


Human annexin IV (ANX4) mRNA 


176 


U72882 


TTuman mtpHWon-induced leucine zirjDer 

J. AUAAICUA AA1 Lv<l IVi VA1 J-L1\J.IA\/\*V* IWUX'AAAV/ 4->l^J^-J\sl. 

1 protein (IFP35) mRNA 


177 


AF054825 


Homo sapiens VAMP5 mRNA 


178 


AF014398 


TTrvmn caniPtiQ Tnvn-innQitnl 

A AVjilJU OdLliwllO XlljSJ Xll\Jol\\Jl 

monophosphatase 2 mRNA 


l/y 


Uzol 74 


Human pre-granzyme 3 mRNA 


181 


U49392 


Human allograft iifflarrirnatory factor- 1"^ 
(AIF-1) mRNA 


182 


D83174 


Human mRNA for collagen binding 
protein 2 


183 


L33075. 


1AUAA1U odUiCllo 1 do VJ A 1 CXoC' Civil YClllIA£, AJJVV 

protein (IQGAP1) mRNA 


184 


AB020630 


TTomo QflnipnQ rrTR"NA for K1A A089^ 

XAUillU oCtpiCIlO AAAAVl>.r^. 1\JL XvJLTVrVV/OZ<J 

protein 


185 


AI018098 




-186 


Z22971 


H.sapiens mRNA for Ml 30 antigen 
exfracellular variant 


1 on 




Human nexoianase m ^riivo ) mKiN/v 


188 


K00650 


Human fos proto-oncogene (c-fos) 


189 


U49187 


Human placenta (DifT48) mRNA 


i on 

iyu 


T Oil 1A 


Homo sapiens metase (jylb i -i ) mKJNA 


191 


X61118 


Human TTG-2 mRNA for a cysteine rich 
protein with LEVI motif 


192 


AB023211 


Homo sapiens mRNA for KIAA0994 
protein 


. 193 


AL031282 


Human DNA sequence from clone 283E3 
on chromosome lp36.2 1-36.33. Contains 
the alternatively spliced gene for Matrix 
Metalloproteinase in the Female 
Reproductive tract MIFR1, -2, 
MMP21/22A, -B and -C, a novel gene, the 
alternatively spliced CDC2L2 gene for 
Cell Division Cycle 2-Like 2 (PITSLRE, 

p58/GTA, Galactosyltransferase 
Associated Protein Kinase) beta 1, beta 2- 
1, beta 2-2 and alpha 2-4, a 40S Ribosomal 
Protein S7 pseudogene, part of the 
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SEQ ID NO 


Entrez Accession No. 


Reported Source of Entrez Sequence 






KIAA0447 gene, a novel alternatively 
spliced gene similar to many 
(archae)bacterial, worm and yeast 
hypothetical genes, and the GNB1 gene for 
Guanine Nucleotide Binding Protein (G 
protein), Beta polypeptide 1 (Transducin 
Beta chain 1). Contains putative CpG 
islands, ESTs, STSs and GSSs 


194 


X15376 


Human mRNA for GABA-A receptor, 
gamma2subunit 


195 


AB007864 


Homo sapiens KIAA0404 mRNA 




TT1 O/ITI 
U 1Z4/1 


Human thrombospondin-1 gene 


197 


S71043 — 


Ig alpha 2 immunoglobulin A heavy chain 
^allotype 2 {constant region, germ line} 
{human, peripheral blood neutrophils, 
Genomic] 


198 


M14758 


Homo sapiens P-glycoprotein (PGY1) 
mRNA 


199 


X84373 


H.sapiens mRNA for nuclear factor 
RIP 140 


200 


lVXj*t*tZrJ 


XxUlIlctll UCla-galaLlUMUobC ^vJJ-dDl J IIJJvTN/A. 


201 


/VTuO I\jD\j 


J1UII1U oapiCXlo III UoCUlXIl IIJLEVlNjrV. 


202 


U95006 


Human D9 splice variant A mRNA 


203 


AA631972 


* 


204 


Z24724 


H.sapiens polyA site DNA 


205 


AF055008 


Homo sapiens clone 24720 epithelin 1 and 

1 niKJNA 


206 




JjLvslxlVJ oO-JJlvXlO JJLJLXVL^^V 1U1 v^xlvgUxxxl • 


208 


AT1 42565 




210 


M10098 


Human 1 8S rRNA gene 


211 


U82303 


Homo sapiens unknown protein mRNA 


212 


L25081 


TTninn sjrnieri^ (t'I Hase fThof 1 ^ mRNA 

llUiliv OCl^/lVXXO VJ XX CWv ylUUvi XJLXXVJl ^ x\ 


214 


Ml 3755 


Human interferon-induced 17-kDa/15-kDa 
protein mRNA 


216 


M13142 


Human factor XI (blood coagulation 
factor) mRNA 


217 


. AI687419 




219 


AI961743 




220 


X93086 


H.sapiens mRNA for biliverdin IX alpha 
reductase 


221 


J04765 


Human osteopontin mRNA 
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SEQ ID NO 


Entrez Accession No. 


Reported Source of Entrez Sequence 


000 




Homn ^fliSiens svntaxin 1 A mRNA 


223 


X91817 


H.sapiens mRNA for transketolase-like 
protein 


224 


S81914 


IEX-l=radiation-inducible immediate- 
early gene [human, placenta, mRNA] 


225 


X02419 


H.sapiens uPA gene 


226 


X01630 


Human mRNA for argininosuccinate 
/ synthetase 


227 


AJ001481 


Homo sapiens mRNA for DUX1 protein 


008 


TT7Q79S 
U /y IAD 


"Human A^l antifreTi nrecursor mRNA 


229 


Ml 4083 


Human beta-migrating plasminogen 
activator inhibitor 1 mKJN A 


230 


"U03688""~""~ 


Human dioxin-inducible cytochrome P450 
(CYPlBl)mRNA 


231 


U67963 


Human lysophospholipase homolog (HU- 
K5) mRNA 


232 


AJ223183 1 


Homo sapiens mRNA for DORA protein 


233 


AB018549 


Homo sapiens MD-2 mRNA 


234 


U07563 


Human ABL gene, exon lb and intron lb, 
and putative M8604 Met protein (M8604 
Met) gene 


235 


AI96122U 




236 


S82470 


BB1 malignant cell expression-enhanced 
gene/tumor progression-enhanced gene 
[human, UM-UC-9 bladder carcinoma cell 
lme, mKJNAj 


237 


Y14737 


Homo sapiens mRNA for immunoglobulin 
lambda heavy chain 


240 


Y09392 


H.sapiens mRNA for WSL-LR, WSL-S1 
and WSL-S2 proteins 


241 

• 


AL031685 


Human DNA sequence from clone RP5- 
963K23 on chromosome 20ql3.1 1-13.2 
Contains a KRT18 (Keratin type I, 
Cytoskeletal 1 8 (Cytokeratin 1 8, 
CK18,CYK18)) pseudogene, a gene for a 
novel protein, the SPATA2 gene for 
spermatogenesis associated protein 2 
(KIAA0757) and the 3 end of the gene ior 
KIAA0939 (novel Sodium/hydrogen 
exchanger familv member). Contains 
ESTs, STSs, GSSs and four putative CpG 
islands 


242 


AL049963 


Homo sapiens mRNA; cDNA 
DKFZp564A132 (from clone 
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SEQ ID NO 


Entrez Accession No. 


Reported Source of Entrez Sequence 






DKFZp564A132) 


243 


AL022162 


Homo sapiens DNA sequence from PAC 
454M7 on chromosome Xq25-26.3. 
Contams the OCRL1 gene tor Lowe 
Oculocerebrorenal Syndrome protein 

OCRL-1 . Contauis Jio i s, o i os ana vjoos 


244 


AF022375 


tfomo sapiens vascular endothelial growth 
factor mKJNA 


245 


U29943 


Human ELAV-like neuronal protein-2 
Mel-JNz mKJNA 


246 


M64929 


Human protein phosphatase 2A alpha 
subunit mKJNA 


247 


AF001862 


Homo sapiens FYN binding protein I 
mRNA 


248 


X04371 


Human 1 .6Kb mRNA for 2-5 A synthetase 
induced by interferon 


249 


X04011 


Human mRNA of X-CGD gene involved 
in chronic granulomatous disease located 
on chromosome X 


250 


AB016789 


JtlOmO SdpiCIlo IHIViNi^. 1U1 

Glutamine :fructose-6-phosphate 
amidotransferase 


Zjl 


AF002697 


Homo sapiens E1B 19K/Bcl-2-binding 
protein Nip3 mRNA, nuclear gene 






encoding mitochondrial protein 


252 


UZlo /o 


Human mRNA for BST-1 


253 


U50535 


Human BRCA2 region, mRNA sequence 
CG006 


254 


U20350 


Human G protein-coupled receptor V28 
mRNA 


255 


AF045800 


Homo sapiens gremlin mRNA 




D89974 


1 Homo sapiens mRNA for 
elvcosvlphosphatidyl inositol-anchored 
protein GPI-80 


257 


DDI fO^t 


"Human mRNA for KIAA0064 gene 


258 


X54489 


Human gene for melanoma growth 
stimulatory activity ^jyivjoaj 


259 


AB014555 


Homo sapiens mRNA for KIAA0655 
protein 


260 




Human retinoblastoma associated (RBI) 
mRNA 


261 


D38552 


Human mRNA for KIAA0073 gene 


262 


X04106 


Human mRNA for calcium dependent 
protease (small subunit) 
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SEQ ID NO 


Entrez Accession No. 


Reported Source of Entrez Sequence 


263 


U92818 


Homo sapiens c33.28 unnamed HERV-H 

pi U IC ILL IIUVIN/A. 


264 


AF010316 


Homo saniens Piel2 (PIG 12) mRNA 


265 


X54486 


Human gene for CI -inhibitor 


266 


D64142 


Human mRNA for histone Hlx 


267 


M15330 


Human interleukin 1-beta mRNA 


268 


J02931 


Human placental tissue factor (two forms) 
/ . mRNA 


269 


U48807 


Human MAP kinase phosphatase (MKr-2) 
mRNA 


270 


M27826 


Homo sapiens human endogeneous 
retrovirus RTVL-H neutral protease large 
subunit mRNA 


271 


AL049250 


Homo sapiens mRNA; cDNA 
DKFZp564Dl 13 (from clone 
DKFZp564D113) 


272 


AB002308 


Homo sapiens mRNA for KIAA03 10 
protein 


275 


AI817548 




276 




Human ititerleulcin 8 rkTLR^ -ffene 


278 


Z38026 


H.sapiens mRNA for FALL-39 peptide 
antibiotic 


280 


D49357 


Human mRNA for S-adenosylmethionine 
synthetase 


281 


J02871 


Human lung cytochrome P450 (IV 
subfamily) BI protein 


282 


M55405 


Homo sapiens mucin (MUC-3) mRNA 


283 


W27095 




284 


U64197 


Homo sapiens chemokine exodus- 1 
mRNA 


285 


X52015 


H.sapiens mRNA for interleukin- 1 
receptor antagonist 


286 


X95808 


"LT canipnc ml? "NT A finr rvrv^ffMn f*t\c^c\f\(^.(\ \wj a 

xi.Ddpiciib mxsj>ijfV lor piuiciii cii^uucu uy a 

candidate gene, DXS6673E, for mental 
retardation 


288 


AF027516 


Homo sapiens trans-golgi network 
glycoprotein 51 (TGN) mRNA 



[0041] CCI-779 activity genes represented by each qualifier in Table 2 can be 

identified based on the HG-U95Av2 gene chip annotation provided by Asymetrix. Genes 
thus identified are illustrated in Table 5. CCI-779 activity genes can also be determined 
based on the corresponding Entrez accession numbers. In addition, CCI-779 activity genes 
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can be determined by BLAST searching the corresponding CPSs, or the unambiguous 
segments of the corresponding SEQ ID NOs, against a human genome sequence database. 
Suitable human genome sequence databases for this purpose include, but are not limited to, 
the NCBI human genome database. The NCBI also provides BLAST programs, such as 
"blastn," for searching its sequence databases. 

Table 5. CCI-779 Activity Genes 







Sequences Useful For Making Probes/Primers 
for CCI-779 Activity Genes 


i 


DKFZP564E1962 


SEQ ID NO: 1 


2 


ETR101 


SEOIDNO:2 


3 


PFN1 


SEQ ID NO: 3 


4 


APOBEC1L 


SEQ ID NO: 4 


5 


— «egAH— - - 1 


SEQ ID NO: 5 


i 6 


KIAA0220 


SEQ ID NO: 6 


7 


COROIA 


SEQ ID NO: 7 


8 


GNL1 


SEQ ID NO: 8 


9 


AKR1A1 


SEQ ID NO: 9 


10 


ID2 


SEO ID NOS: 10, 289, and 353 


11 


PSG7 


SEQ ID NO: 11 


13 


IDH2 


SEQ ID NO: 12 


14 


RER1 


SEQ ID NO: 13 


15 


TUBB 


SEQ ID NOS: 14 and 290 


16 


HADH2 


SEO ID NO: 15 


17 


NDUFB5 


SEQ ID NO: 16 


18 


APRT 


SEQ ID NOS: 17 and 310 (Y00486) 


19 


PSME2 


SEQ ID NOS: 18 and 291 


20 


MTCP1 


SEQ ID NOS: 19 and 31 1 (Z24459) 


21 


NDUFB3 


SEO ID NOS: 20 and 312 CA203354) 


23 


CTSL 


SEQ ID NO: 21 


24 


FGL2 


SEQ E) NOS: 22 and 292 


25 


DKFZP564C1940 


SEQ ID NO: 23 | 


26 


28SRNA5 Hs AFFX 


CCA TTV XTfV OA 


27 


SEMA3F 


SEQ ID NO: 25 


28 


BST2 


SEQ ID NO: 26 


29 


UNK AL050021 


SEQ ID NO: 27 


30 


UBE2L6 


SEQ ID NO: 28 


31 


SLC11A1 


SEQ ID NOS: 29 and 354 


32 


CDKN1C 


SEQ ID NOS: 30 and 313 (U22398) 


33 


GAIP 


f SEQ ID NO: 31 


34 


TACC2 


SEQ ID NOS: 32 and 355 


35 


CLNS1A 


SEQ ID NO: 33 


36 


DPYSL2 


SEQ H) NO: 34 


37 


FCGR3A 


SEQ ID NO: 35 


38 


UQCRC1 


T SEQ ID NO: 36 


39 


GZMA 


SEQ ID NO: 37 
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CPS 


CCI-779 Activity Gene 


kSequcnccs useiui jror iviaKing x i uu«»/ a i uuw a 
IOr V^v>I- / iy Acuviiy vjaiw 


41 


PSMB10 


otjA tt\ xtac ie QnH lid Of 71 R74i 
bliQ ID INUo. Jo ana jih io /tj 


42 


IDS 


oiiv J-D inc. 


43 


A T TV T A T/" 

AHNAK 


bEQ ID JNUb: 4U, d ID ^MoUo^f J, <mu jjo 


44 


PTPN7 


OTT A TT\ XTPbC* /11 ortH 90^ 

bisQ ID JNUo. 41 anu Ay d 


45 


DUSP2 


OEAJ ID iNVJ. *tZ 


46 


PSMB9 


bHQ ID INC 


47 


UNK AI652978 


bHQ ID INC *f*r 


48 


UNK AF025529 


/ bliQ ^D INC hD 


49 


SDR1 


1 SEOIDNO:46 


50 


. POLR2L 


SEO ID NO: 47 \ 


51 


SCYA2 


SEQ ID JNUb. 4o ana J.y't 


52 


PAI2 


SEQ ID JNU: 4y 


53 


UNK U66047 T 


SEQ ID JNU: DU 
cfo/^v nr\ xta. c 1 


^ — 
55 


UNK U00930 
JMJ 


SEQ-ID JNU: 2)1 
SEQ ID JNU: DZ 


56 


UGP2 


SEO ID INUo: Di ana do i 


57 


KIAA0410 


SEQ ID JNU: D4 j 


58 


NDUFS7 


SEQ ID NUo: 33 ana 310 va^uud^z^j 


59 


KIAA0645 


bliQ ID JNU: DO 


60 


GSTP1 


SEQ ID JNUb: d /, zyj, ana 31/ iuizh/z; 


61 


DECR1 


SEQ ID JNUo. jo ana Jlo (U /o^uzj i 


62 


PLXNC1 


SEQ ID NOS: 5y ana 3ly (ArUJUj^y; 


63 


TUBA2 


SEQ ID NOS: 60 ana izU ^AJrUUD^yzj 


64 


NME1 


bbQ ID JNU: oi 


65 


UNK AI679353 


bbQ ID JNU. 0/ 


66 


RNAHP 


SEQ ID NUb: oJ ana 3zi ^xioojh-uj 


67 


ZNF198 


cur^ IT* XTfV Ail 
br!/v ID INC Oh 


68 


PRCP 


otSQ ID INC OD 


70 


DKFZP586A0522 


oiiQ ID JNU. OO 


71 


SFRS11 


biiQlDlNU. 0/ _| 


72 


RAC2 


bxlQ ID JNU. Oo 


73 


ZFP103 


bHQ ID JNU. 


74 


LOC51580 


SEQ ID JNUo: /u ana dDo 


75 


NRD1 


btsy iu JNUo. / 1 ana jjy ] 


76 


UNK AF070644 


oiiU WJ INC /z , 


77 


KIAA0906 


orSy lNU. ID 


78 


MT1F 


ocr\ m xrriQ- 7A anH ^79 rrvA1 004.^ 

! oliQ ID INvyO. /H aXLU jZZ ^IVllV/7*tJj 


79 


P85SPR 


ocy iiv INC i j 


80 


• T T\TT/ \T^O f jIT 

UNK N53547 


<3FO TT> NO- 76 


81 


GPRK5 


cca TTI MO^« 77 flnH 906 
IX/ lN\-/o. / / aliiX 


83 


SCYA7 


i rn no* 7R 


OH 


TUB A3 


SEQ ID NO: 79 


85 


SCML2 


i SEQ ID NO: 80 


87 


IL1R1 


SEO ID NO: 81 


88 


UNK AL008729 


SEO ID NO: 82 


. 89 


KIAA0191 


SEO ID NO: 83 
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CPS 


CCI-779 Activity Gene 


Sequences Useful For Making rrobes/Jrrimers 
for CCI-77y Activity oenes 


AA 

90 


EGFL5 


nCA TTV XT/^V» OA 

SEQ ID NU: 84 


91 


TNT TOTJ1 

DUSrl 


SbQ JJJ jnu: o!> 


/"\0 

92 


FBP1 


SEQ ID NOb: 86 and 323 (U2 1931 j 


93 


T TTI TN 

HRB 


nr/\ ttn \TrvC . o*7 «%««<4 OO/I /T /IOA.O<\ j 

SEQ ID NOS: 87 and 324 (LAlv25) 


94 


XT A /~* A 

NAGA 


SEQ ID NO: oo 


95 


GLRX 


SEQ ED NO: 89 


96 


UNK W28281 


SEQ ID NOS: 90 and 3oU 


97 


UNK AL096740 


riT*'/^ TT\ "xt^Nci A - ! IOC /AT AA/COvlA'N J O /T 1 

SEQ ID NOS: 91, 325 (AL096740), and 3ol 


98 


TBXAS1 


OPA TTV \TAO . iNO OO^ /TNI yi C\ OiCO n*->>4 O /CO 

SEQ ID NOS: 92, 326 (D3462j), 362, and 3o3 


99 


DPYD 


SEQ ID NO: 93 


100 


ECGF1 


SEQ ID NOS: 94 and 297 


101 


TIMP1 


SEQ ED NOS: 95 and 327 (Dl 1 139) 


102 


GSN 


SEQ ED NO: 96 


103 


SECTM1 


SEQ IDNO: ^ - ~*- - ■ 


104 


OLR1 


SEQ ED NOS: 98 and 328 (AF079167) 


105 


D6S49E 


SEQ ID NOS: 99 and 364 


106 


JUNB 


SEQ ED NO: 100 


107 


PFC 


SEQ ED NOS: 101 and 329 (AF005664) 


108 


POLR2E 


ctt"»/"n ttn \ta. i /\o 

SEQ ED NO: 102 


109 


111! /"^ ^ 

THBS1 


OTA 11% \TA. 1 AO 

SEQ ED NO: 103 


110 


HK2 


SEQ ID NOS: 104 and 330 (Z46376) 


111 


INSIG1 


SEQ ID NOS: 105, 331 (U96876), 365, and 366 


112 


HCK 


SEQ ID NOS: 106 and 298 


1.13 


HP 10390 


SEQ ID NO: 107 


114 


UNK U51712 


nrA m \ta. 1 no 

SEQ ED NO: 108 


115 


KIAA0217 


SEQ ED NO: 109 


116 


NCF1 


C1T"?/"\ TTN \TA. 1 1 A 

SEQ ED NO: 1 10 


117 


KIAA1113 


SEQ ID NOS: 111 and 367 


118 


DKFZP566H073 


Or7/"\ TTV XTA. 1 1 O 

SEQ ID NO: 112 


119 


ATP2B1 


SEQ ID NOS: 1 13 and 368 


120 


IFITM2 


£"lT?/"\ 1 IN XTA. 1 1 A 

SEQ ID NO: 1 14 \ 


121 


MNDA 


CM~!/~\ TTN XTA. 1 If 

SEQ ID NO: 115 


122 


BTN3A2 


nrjA TTN VTA. 1 1 

SEQ ID NO: 1 16 


123 


KIAA0776 


OTA TTN "XT/~\. 1 1 H 

SEQ ID NO: 117 


124 


D6S2245E 


<"IT"»a'"\ TTN XTA. 1 1 O 

SEQ ED NO: 118 


125 


KIAA0628 


OTA TTN TWT/^k. 1 1 A 

SEQ ED NO: 119 


126 


SELL 


OT?/^ TTN *X.T/*\. 1 OA 

SEQ ED NO: 120 


127 


MPV17 


OCA TTN XTA. 1 Ol 


128 


T7 A Trnr 

FABP5 


SEQ ID NUb: izz ana 33z (jviy^ojoj 


129 


CASPl 


APA TTN \TAQ, 1 01 or\A OQO 

ohrQ eu JNUo. izj ana zyy 


130 


PTPN6 


O T7/^i TT\ XTO« 1 OA 


1 31 

101 




SEO IDNO # 125 


133 


GR02 


SEO ID NO: 126 


134 


RRAS 


SEO ID NO: 127 


135 


KIAA0022 


SEO ED NOS: 128 and 369 


136 


FLn 


SEO ID NOS: 129 and 333 (U80184) 
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CPS 


CCI-779 Activity Gene 


ocquenCes useiui for iviaKiJig jrruuca/x uiucrs 


1 Q7 
13 / 


xlvT 1 


oliU JUL' JNUo. uu aiivx J j" ^lviZrOj^jy 


1 

loo 


VTA Am 09 


cr?n TD NO- 1^1 

OJ-/U Ai-/ Ul 


1 3y 


T JK\V A A 1 7A7BO, 
UJNJv AAl/O/oU 


qtja xn no* 1^9 


1 Af\ 

140 


r YU.L 


QT2n TT^ XTT^C' 1 *M onH f AT? 0467 Q PA 

o.EA^ 1U JNUo. Ijj dnu jjj ^rvrv/H-o /yoj 


1 A 1 

141 


Dor 


OTJO TTI MH 1 1 
Oiiv 11-' 1NL/. 1 DH 


1 A1 

143 


KANBrzLl 


oun tfi vn. i 

oliy ll^ 1N\J. 1 DD 


1 A A 

144 


IJNror 1U 




1 A C 

145 




/ oxiv 11^ iNw. 1j/ 


146 


SDS 


i oiiv ID JNUo. i jo ana o /u 


147 


KIAAUj to 


Oliv JNU. 1 J7 


i in 

148 


rCOK3r> 




149 


T7/^>T7T> 1 A 


oliV^ ID Fi\J. l*tl 


1 c £\ 

150 


CD44 


ohKi iu iNUo. i4z ana jjo ^iajdh^) 


~ 151 


^IDl^-^< - ^- 


b.bv^ 11/ JNUo. 14j ana juu 


152 


UNK AI760801 


hcX^ ID JNU. 144 


153 


A TVI *L50 

ADTB2 


oJiy JLD JNU. 14D 


•* C A 

154 


TV jT A TNTT'} 

MADH3 


b.cv2 ad JNUo! 140 ana oui 


•ICC 

155 


ARPC1B 


oliy ID JNU. l*f / 


156 


TAT 

TALDOl 


oJby ID JNUo. 145, j j 1 ^/Vruiu*fuuj, ana o / i 


157 


UNK AL035U79 


ojpy jld jnu. my 


158 


CTVDD1 

blXBPl 


O-tiv IJ-' JNU. lD\J 


159 


UNG2 


oiiv ID JNU. 1D1 


160 


T"\T TO TIC 

DUSP5 


CTjn T"n xirv 1 <;9 

OXiv ID JNU. 1DZ 


161 


PTPRO 


oJly ID INU. 103 


162 


OT A 

SLA 


oliv ad JNUo. ioh ana ouz 


163 


RFP2 


oliv AD INU. 1DD ! 


164 


A T\T" , TI 

ADFP 


oJiv^ id JNUo. iDo ana j /z 


•* f c 

165 


A TV 1 'V> 

ADTD 


oiiv^ id JNUo. id i ana d id 


166 


a T\ r r/™ 1 

ADICj 


oHv I-*-' JNU. IJO 


168 


T TKTX7" voto a a 

UNK X87344 


oliv AD JNUo. i oy ana djo \a\o /dhh) 


170 


/-i a ryrz 
CAtvj 




171 


a tit t 
AKL/ 


QThH TFi "KTO* 1 61 


172 


KuVl 


^T?n TD >JO- 1 #t9 


173 






175 


UJNK ArU/UoUo 


I QPO TH NO* 1 #vi 


176 


An ATT 

AUAJri 


<5PO TTI "NFfV 1 6S 
oily 1±J iH\J. 1UJ 


177 


T TXTV A J^OA A A^ 

UNK AJ224442 


o£^v A-* - ' JNU. 1DD 


1 TO 

178 


MoKl 




179 


XAr4 


OX2/V< JJ-/ INU. luO 


1 OA 

180 


r^i rM5 
i^lv^xC 


^FO TD NO' 1 6Q 


lol 




SFO TD NO* 170 


182 


UNK AL049309 


SEQIDNO: 171 


183 


RNASE2 


SEQ ID NO: 172 


184 


NCF4 


[ SEO ID NOS: 173 and 303 


185 


ANXA4 


SEQ ID NO: 174 


186 


ME2 


SEQ ID NOS: 175 and 339 (M55905) 
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CPS 


CCI-779 Activity Gene 


Sequences Useful For Making rroDes/iriniers 
for CO- 11 y Activity Irenes 


187 


IFI35 


SEQ ID NU: 1 /o 


188 


VAMP5 


SEQ ID JNU: ill; 


189 


IMPA2 


SEQ ID JNU: 1 /o 


190 


GZMK 


SEQ ID JNU: 1 /y 


191 


BGN 


SEQ ID NOS: 180 and 340 (J045yy) 


192 


AIF1 


SEO ID NU: 181 


193 


CBP2 


SEQ ID JNU: loz 


195 


IOGAP1 


SEQ ID NU: 183 


196 


KIAA0823 


SEQ ID NU: 184 


197 


UNK AI018098 


SEQ ID NOS: 185 and 374 


198 


CD163 


SEQ ID NO: 186 


199 


HK3 


SEQ ID NO: 187 


200 


FOS 


SEQ ID NO: 188 


- 201" ■ 


DIFF48 


SEQ ID NO: 189 


202 


UNK L23134 


SEQ ID NO: 190 


203 


LM02 


SEQIDNO:l91 _J 


204 


PDI2 


SEQ ID NO: 192 


205 


UNK AL031282 


SEQ ID NO: 193 


206 


GABRG2 


SEO ID NO: 194 _j 


207 


KIAA0404 


SEQ ID NO: 195 


208 


UNK U12471 


SEQ ID NO: 196 


209 


IGHA1 


SEQ ID NOS: 197, 304, and 305 


210 


ABCB1 


SEO ID NO: 198 


211 


NRIP1 


SEQ ID NO: 199 


213 


GLB1 


SEQ ID NO: 200 


214 


MSC 


SEQ ID NO: 201 


216 


UNK U95006 


SEQ ID NO: 202 


217 


f NK4 


SEQ ID JNU: ^03 


218 


UNK Z24724 


SEQ ID NO: 204 


219 


GRN 


SEQ ID NO: 205 


220 


EREG 


SEQ ID NO: 206 


221 


KIAA0382 


SEQ ID NOS: 207 and 341 (AB002380J 


222 


RNASE6 


SEQ ID NO: 21)8 


223 


KIAA0442 


SEQ ID NOS: 209, 342 (Atf00/90z), ana o /o 


225 


18SRNA3 Hs AFFX 


SEQ ID NO: 210 


226 


UNK U82303 


CiT?/"\ TT"\ XT/*^» Ol 1 

SEQ ID NO: 211 


227 


ARHC 


SEQ ID NO: 212 


228 


CREM 


SEQ ID NOS: 213, 30o, ana 343 ibooi^j 


229 


ISG15 


SbQ ID JNU: Z14 


230 


I CCR7 


SEQ ID NUb: 215 ana 344 \l,d ido**) 


231 


Fll 


SEQ ID NUb: zlo ana 3 /o 


2.51 


TTNK AT68741Q 


SEQ ID NO: 217 


233 


UNK AC002073 


SEQ ID NOS: 218, 345 (AC002073\ and 377 


234 


STK17A 


SEQ ID NO: 219 


235 


BLVRA 


T ^ SEQ ID NO: 220 


236 


SPP1 


SEQ ID NOS: 221 and 307 
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CPS 


CCI-779 Activity Gene 


Sequences Usetul ror JVlaKing r roues/ jrriiners 
lor CC1- / /5f Acuviiy Irenes 


237 


CTV1 A 

SlAlA 


CCA TT^ MH* 000 
bliQ ID INU. ZZZ 


238 


Tl/TT 1 

1K1L1 


nrn tt\ xjfV 00^ 
oHQ JLU JNU. ZZO 


AAA 

239 


1EK3 


crn -rn "wo* oo*l 
bHQ JU-J JNU. ZZH- 


241 


TIT ATT 

PLAU 


OTJA TT> XTfV 00^, 

bliQ 1JJ iNU. ZZJ 


242 


A OO 

ASS 


OCA TT\ XTfV 00£ 

!S.bQ 1JJ JNU. ZZO 


244 


T%T TV 1 

DUXl 


nrjA Tpw XT/"V OOO 

seq iNu. zz/ 


245 


GPA33 


bHQ iiJ JNU. ZZo 


246 


PAI1 


i CCA TT* XTA* OOO 

/ SEQ ID INU. zzy 


247 


CYP1B1 


1 SEQ ID NUb: Zd\) ana 3Uo 


249 


HU-K5 


nrA m\TAC. 031 o-m^ QOC 

SEQ ID NUb: zil ana j /o 


250 


DORA 


CTJA TT\ XTA- OlO 

SEQ ID NU. z3z 


253 


MD-2 


OCA T"Tl xt/"v 0^1 

SEQ ID JNU. 2.3 d 


254 


UNK U07563 


otja TT\ XTA. Ol A 

SEQ ID JNU: z34 


' 255 


SPINK1 -~- 


OCA TT\ XTA. OK 

. , SEQ ID JNU: ziD . 


256 


UNK S82470 


crA m XTA. 0*2£ 

SbQ ID JNU. Z30 


257 


IGHG3 


OCA TT\ XTA. OOO 

SEQ ID JNU: 15 1 


258 


UNK AA151971 


npA TTN "XT AO- AO O O A A 1 AC f A A 1^1 OT 1 ^ ot-l/1 'iOQ 

SEQ ID NOS: 238, 309, 346 (AAlMy /lj, ana 3/y 


259 


UNK AA2 16639 


rtrifv TT\ XT AO. A nM >4 O A *7 / A A01 Q\ 

SEQ ID NOS: 239 ana 34/ (AAzioojyj 


260 


TNFRSF12 


CT7A TTA XTA. O A A 

SEQ ID NO: 24U 


261 


UNK AL031685 


OT7A TA XTA. AA 1 

SEQ ID JNU: z4I 


262 


UNK AL049963 


OT7A TT\ XTA. O A O 

SEQ ID JNU: z4z 


263 


APELIN 


nrjA TT\ XTA. O A 1 

SEQ ID NU: 243 


264 


VEGF 


nrjA TT\ XTA. AAA 

SEQ ID NU: z44 


_ 265 . 


ELAVL2 


CCA 1T\ XTA. OA^ 

SEQ IJJ NU: Z4D 


266 


PPP2R2A 


CT2A TT\ XTA. O/K 

SiiQ IJJ NU: z4o 


267 


UNK AF001862 


OT7A TTw XT AC* OAT avis! IQft 

SEQ ID NUS. z4/ ana joU 


268 


0AS1 


CD A TTV XTr\. O/IC 

SEQ ID NU: z4o 


269 


CYBB 


CUA TT\ XTA. O AO 

SbQ id nu: z4y 


270 


GFPT2 


CTJA TTN XTA. O CA 

SliQ JLU JNU, ZDU 


271 


BNIP3 


CD A TT\ XTA. A<1 

SJiQ 1U JNU: zjI 


272 


BST1 


CDA TT\ XTA* A*rO 

SiiQ IU JNU. ZDz 


273 


UNK U50535 


CtjA irv \TO« 0^^ 
[ oHQ ID JNU. ZD j 


274 


CX3CR1 


CCA in XTA. AC A 

oXiQ JLU XNU. ZD4 


275 


CKTSF1B1 


| briQ IU INU. ZDD J 


276 


VNN2 


CCA TCV XTA, O^A 

OJiQ 1U XNU. ZDO 


277 


KIAA0064 


OCA TT\ XTAQ* O^O anH 

oJiu lu JNUo. zo / ana joi 


278 


GR01 


f CPA TT\ XTA. O<0 

SiiQ 11/ JNU. ZDo 


279 


KIAA0655 


CCA TT^ XTA. ACQ 

oJtlQ 1U xnu. zoy 


280 


RBI 


OCA TTV XTO* 0A0 

oHQ -»»U INU. ZOU 


281 


KIAA0073 


OCA TT\ XTA. Azri 

oHy IX-/ 1N\J. ZOl 


282 


yt A TVKTA 

CAPN4 


<3pn tt) r\rn« o^o 

oEy IX-/ IN^. ZOZ 


ZO J 


1 UNK U92818 


SEQ ID NO: 263 


284 


MGST1L1 


j SEQ ID NO: 264 


285 


C1NH 


SEOIDNO:265 


286 


H1FX 


SEQ ID NO: 266 


. 287 


IL1B 


SEOIDNO:267 
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CPS 


CCI-779 Activity Gene 


sequences useiui nor iviaiung r roDes/ri liners 
ior v^v^i- / Activity vvenes 


98R 

ZOO 


F^ 


oDl^ iXJ JNU. ZOO 


980 
zoy 




CDfi Tn XT/V 7£Q 


900 


HTTMPTVT FTC 


0£\l id 1NVJ, z/u 


901 

Z7l 


TTNTK AT 0409 SO 


cT7n m tvt/v 771 


909 

Z7Z 


TTMK AF*0091flR 


cT?n rn xrn* 979 

oiiV ^ xNVA z/z 


90^ 


ftynh 


oJDl^ XU JNUo. Z/O alia j^o ^JUJUUoJ 


905 


Fft AT9 
rivrv X Z 


crn rn xrrvQ* 774 iaq f Avn&yrxQ} 

• oIjv I-'-' INUo. Z/H aiiU jny ^AruOZ./j^J 


906 


TTNTK' AT81754R 
Ul>iv rYXO 1 /J*+o 


Qi?n jt\ xrnQ- 77^ on/i 1S7 

oHy LU INL/o. Z/j alia joZ 


907 
zy / 


TT 8 


on*Kl ID INvJ. Z/O 


90S 


r ov^jnz 


C 17 f\ Tn XTf^CS OT7 f A T1 COA01 , \ on/I 1C5 

oJtiv^ ID JNvJo. z / /, jDU (Alloyozl ) 9 ana jod 


inn 


t^AIVLr 


oliV^ LU INU: z/o 




r^OJvl A 


obv*) lu JNUo: z/y ana 3M (MoJooS) 


JUZ 


JV1A 1 1A 


o£2ri^ JJJ lNUo. zoU ana 3o4 


D\)D 


rVD/1D1 


* • 5 ..... . bbQ ID InU. zqI 




MUU 


csr,Kl ID JNU. Zoz 




T*7 


oiii^ 1JLF INU. Zoj 


JVv 


SPVA90 


9FO TTj NO* 9R4 


307 


IL1RN 


SEQIDNO:285 


308 


ZNF261 


SEQIDNO: 286 


309 


LGALS2 


SEQ ID NOS: 287, 352 (AL022315), and 385 


310 


TGN51 


SEQ'IDNOS: 288 and 386 



[0042] In one embodiment, the BLAST search of the NCBI human genome database 

is conducted by using CPSs. Gene(s) that aligns to a given CPS with at least 95% sequence 
identity can be identified. In many cases, the identified gene(s) has at least 96%, 97%, 98%, 
99%, or more sequence identity with the CPS. The results of the BLAST search are 
detailed below. 

[0043] CPS 1 corresponds to DKFZP564E1962 (TXNDC) which encodes 

thioredoxin domain-containing. This gene has LocusID: 81542, and is located on 
chromosome 14 with reported cytogenetic location 14q21.3. The gene resides in genomic 
locus NT_025892 (NCBI Genome Annotation). The gene product is a member of the 
thioredoxin family. 

[0044] CPS 1 also has 86-90% sequence identity with an intron sequence of GK003 

which encodes GK003 protein. GK003 has LocusID: 57002, and is located on chromosome 
7 with reported cytogenetic location 7pl5.2. In addition, fragments of CPS 1 align with a 
chromosomal region on chromosome 13 and an intron sequence of OATPRP4 with 87-95% 
sequence identity. The chromosomal region on chromosome 13 is located near ING1 which 
encodes inhibitor of growth family, member 1, and has LocusID: 3621. OATPRP4 encodes 
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organic anion transporter polypeptide-related protein 4, and has LocusID: 81796 with 
reported cytogenetic location 8ql3.1. 

[0045] CPS 2 corresponds to ETR101 which encodes immediate early protein. This 

gene has LocusID: 9592, and is located on chromosome 19 with reported cytogenetic 
location 19pl3.13. The gene resides in genomic locus NT_031915 (NCBI Genome 
Annotation). Expression of immediate early protein can be induced by TP A stimulation in 
promyelocyte leukemia cell line HL-60 and in other, leukemia cell lines. 
[0046] A fragment of CPS 2 (nucleotides! 309 to 492 of CPS 2) shows 98% 

sequence identity with an intron sequence of LOCI 69782. LOG 169782 encodes a protein 
similar to transcription factor IIIA (Factor A) (TFIIIA), and has reported cytogenetic 
location 9p24.1. 

[0047] CPS 3 corresponds to PFN1 which encodes profilin 1. This gene has 

LocusID: 5216, and is located on chromosome 17 with reported cytogenetic location 
17pl3.3. The gene resides in genomic locus NT_033299 (NCBI Genome Annotation). 
Profilin 1 is a ubiquitous actin monomer-binding protein belonging to the profilin family. It 
is thought to regulate actin polymerization in response to extracellular signals. Deletion of 
PFN1 gene is associated with Miller-Dieker syndrome. 

[0048] Nucleotides 7 to 406 of CPS 3 align with various regions in the human 

genome with 86-93% sequence identity. These regions include LOC163511, COAS3, a 
region near LOCI 490 10, a region near LOC200030, an intron sequence of 
DKFZp434D177, FLJ20719, LOCI 99970, and LOC206456. LOC163511 encodes a 
protein similar to profilin I, and has reported cytogenetic location lq23.2. COAS3 encodes 
chromosome 1 amplified sequence 3, and has LocusID: 200025 with reported cytogenetic 
location lql2. LOC149010 encodes a protein similar to hypothetical protein 
DKFZp434D177, and has reported cytogenetic location lql2. LOC200030 encodes a 
protein similar to hypothetical protein DJ328E19.C1.1, and has reported cytogenetic 
location lql2. DKFZp434D177 encodes hypothetical protein DKFZp434D177, and has 
LocusID: 84224 with reported cytogenetic location lp36.12. FLJ20719 encodes 
hypothetical protein FLJ20719, and has LocusID: 55672 with reported cytogenetic location 
lp31. LOC199970 has reported cytogenetic location lpll.l. LOC206456 encodes a 
protein similar to chain P, structure of bovine beta-actin-profilin complex with actin bound 
Atp phosphates solvent accessible. LOC206456 is located on chromosome 6. 
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[0049] CPS 4 corresponds to APOBEC1L (APOBEC3C) which encodes a protein 

similar to APOBEC1 protein. APOBEC1L gene has LocusID: 27350, and is located on 
chromosome 22 with reported cytogenetic location 22ql3.1-ql3.2. APOBEC1L gene 
product is similar to phorbolin (DJ742C19.2), and may catalyze hydrolytic deamination of 
cytidine nucleotides. The gene product contains a cytidine deaminase zinc-binding domain. 
[0050] CPS 4 also has 91% sequence identity with LOC200316 which encodes a 

protein similar to phorbolin 3 (APOBECl-like). LOC200316 has LocusID: 200316 with 
reported cytogenetic location 22ql3.1. In addition, CPS 4 aligns with two other regions on 
chromosome 22 with 89-92% sequence identity. 

[0051] CPS 5 corresponds to CKAP1 which encodes cytoskeleton-associated 

protein 1. This gene has LocusID: 1155, and is located on chromosome 19 with reported 
cytogenetic location 19ql3.11-ql3.12. The gene resides in genomic locus NTJ)11296 
(NCBI Genome Annotation). Cytoskeleton-associated protein 1 associates with 
microtubules. It contains a glycine domain which plays a role in association with 
microtubules. 

[0052] Affymetrix annotation suggests that CPS 6 corresponds to KIAA0220 which 

has LocusID: 23117 and is located at chromosome 16pl2.1. Blast search of the Entrez 
human genome sequence database shows that CPS 6 has 99% sequence identity to a region 
on chromosome 16. This region is near LOC255565, and resides in genomic locus 
NTJ)35368 (NCBI Genome Annotation). 

[0053] In addition, fragments of CPS 6 align with various chromosomal regions 

with at least 90% sequence identity! These regions include LOC220555, LOC124302, a 
chromosomal region near LOC220567, a chromosomal region near LOC254081, and a 
chromosomal region near LOC197363. LOC220555 encodes a protein similar to nuclear 
pore complex interacting protein, and has reported cytogenetic location 16pll.2. 
LOC124302 encodes a protein similar to nuclear pore complex interacting protein, and has 
reported cytogenetic location 18pll.l. LOC220567 encodes a protein similar to 
apolipoprotein B48 receptor, and is located at chromosome 16ql3. LOC254081 encodes a 
protein similar to group X secretory phospholipase A2 precursor (phosphatidylcholine 2- 
acylhydrolase GX) (GX sPLA2) (sPLA2-X), and is located on chromosome 16. 
LOCI 97363 encodes a protein similar to ataxin 2 related protein (isoform 1), and is located 
on chromosome 16. 
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[0054] CPS 7 corresponds to COROIA which encodes coronin, actin binding 

protein, 1A. This gene has LocusID: 11151, and is located on chromosome 16 with 
reported cytogenetic location 16ql3. The gene resides in genomic locus NTJ333291 (NCBI 
Genome Annotation). The gene product (coronin 1A) binds to actin, and may be involved 
in mitosis, cell motility, formation of phagocytic vacuoles and phagocytosis. Coronin 1A 
has at least five WD domains. 

[0055] CPS 8 corresponds to GNL1 which encodes guanine nucleotide binding 

protein-like 1 . This gene has LocusID: 2794, and is'located on chromosome 6 with reported 
cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI 
Genome Annotation). The GNL1 gene, identified in the human major histocompatibility 
complex class I region, shows a high degree of similarity with its mouse counterpart. The 
GNL1 gene is located less than 2 kb centromeric to HLA-E, in the same transcriptional 
orientation. GNL1 is telomeric to HLA-B and HLA-C. 

[0056] CPS 9 corresponds to AKR1A1 which encodes aldo-keto reductase family 1, 

member Al (aldehyde reductase). This gene has LocusID: 10327, and is located on 
-chromosome 1 with reported cytogenetic location Ip33-p32. The gene resides in genomic 
locus NT_032972 (NCBI Genome Annotation). Aldehyde reductase (aldo-keto reductase 
family 1, member Al) reduces carbonyl-containing substrates, and may metabolize 
xenobiotics. It is a NADPH-dependent member of the aldo-keto reductase superfamily. 
[0057] CPS 10 corresponds to ID2 which encodes inhibitor of DNA binding 2, 

dominant negative helix-loop-helix protein. This gene has LocusID: 3398, and is located. on 
chromosome 2 with reported cytogenetic location 2p25. The gene resides in genomic locus 
NT_005334 (NCBI Genome Annotation). The gene product is a member of the Id helix- 
loop-helix family of proteins, and may negatively regulate cell differentiation. 
[0058] CPS 10 also has 95% sequence identity with an intron sequence of PTPRG. 

PTPRG encodes protein tyrosine phosphatase, receptor type, G, and has LocusID: 5793 
with reported cytogenetic location 3p21-pl4. The protein encoded by PTPRG gene is a 
member of the protein tyrosine phosphatase (PTP) family. PTPs are known to be signaling 
molecules that regulate a variety of cellular processes including cell growth, differentiation, 
mitotic cycle, and oncogenic transformation. The PTP encoded by PTPRG gene possesses 
an extracellular region, a single transmembrane region, and two tandem intracytoplasmic 
catalytic domains, and thus represents a receptor-type PTP. The extracellular region of this 
PTP contains a carbonic anhydrase-like (CAH) domain, which is also found in the 
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extracellular region of PTPRBETA/ZETA. PTPRG gene is located in a chromosomal 
region that is frequently deleted in renal cell carcinoma and lung carcinoma, and thus is 
thought to be a candidate tumor suppressor gene. 

[0059] In addition, CPS 10 aligns with a chromosomal region near LOCI 40282 with 

88% sequence identity. .LOC140282 encodes a protein similar to translationally controlled 
tumor protein (TCTP) (p23) (Histamine-releasing factor) (HRF), and has reported 
cytogenetic location 2 1 q2 1 .3 . 

[0060] Affymetrix annotation suggests that CPS 11 corresponds to PSG7 which 

encodes pregnancy specific beta- 1 -glycoprotein 7 and has LocusID: 5676. PSG7 is located 
at chromosome 19ql3.2. 

[0061] Blast search of the Entrez human genome sequence database shows that CPS 

11 aligns with a region 3' to PSG1 with at least 98% sequence identity. PSG1 encodes 
pregnancy specific beta- 1 -glycoprotein, and has LocusID: 5669 with reported cytogenetic 
location 19ql3.2. The gene product is a member of the pregnancy-specific glycoprotein 
(PSG) and CEA families. 

[0062] Moreover, CPS 1 1 has 92-95% sequence identity with various regions on 

chromosome 19. They include PSG2, PSG9, a region near PSG3, and a region near PSG7. 
PSG2 encodes pregnancy specific beta- 1 -glycoprotein 2, and has LocusID: 5670 with 
reported cytogenetic location 19ql3.1-ql3.2. PSG9 encodes pregnancy specific beta-1- 
glycoprotein 9, and has LocusID: 5678 with reported cytogenetic location 19ql3.2. PSG3 
encodes pregnancy specific beta- 1 -glycoprotein 3, and has LocusID: 5671 with reported 
cytogenetic location 19ql3.2. PSG7 encodes pregnancy specific beta- 1 -glycoprotein 7, and 
has LocusID: 5676 with reported cytogenetic location 19ql3.2. 

[0063] CPS 13 corresponds to IDH2 which encodes isocitrate dehydrogenase 2 

(NADP+), mitochondrial. This gene has LocusID: 3418, and is located on chromosome 15 
with reported cytogenetic location 15q26.1. The gene resides in genomic locus NT_033276 
(NCBI Genome Annotation). Mitochondrial NADP(+)-specific isocitrate dehydrogenase 2 
can decarboxylate isocitrate into alpha-ketoglutarate. 

[0064] CPS 14 corresponds to RER1 which encodes a protein similar to S. 

cerevisiae RER1. This gene has LocusID: 11079, and is located on chromosome 1 with 
reported cytogenetic location lpter-q24. The gene resides in genomic locus NT_004350 
(NCBI Genome Annotation). 
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[0065] CPS 15 corresponds to TUBB which encodes tubulin, beta polypeptide. This 

gene has LocusID: 7280, and is located on chromosome 6 with reported cytogenetic 
location 6p21.3. The gene resides in genomic locus NT_034880 (NCBI Genome 
Annotation). Beta-tubulin polymerizes to form microtubules, and is a member of a family 
of structural proteins. 

[0066] CPS 16 corresponds to HADH2 which encodes hydroxyacyl-Coenzyme A 

dehydrogenase, type II. This gene has LocusID: 3028, and is located on chromosome X 
with reported cytogenetic location Xpl 1.2. The gene resides in genomic locus NT_01 1799 
(NCBI Genome Annotation). The gene product can bind to amyloid-beta peptide, 
[0067] CPS 17 corresponds to NDUFB5 which encodes NADH dehydrogenase 

(ubiquinone) 1 beta subcomplex, 5 (16kD, SGDH). This gene has LocusID: 4711, and is 
located on chromosome 3 with reported cytogenetic location 3q27.1. The gene resides in 
genomic locus NT_022396 (NCBI Genome Annotation). The gene product is a subunit of 
NADH-ubiquinone oxidoreductase (complex I). It can transport electrons from NADH to 
ubiquinone. 

[0068] CPS 17 also shows 77% sequence identity with a chromosomal region near 

LOC205833. LOC205833 has reported cytogenetic location 4q34.3. 
[0069] CPS 18 corresponds to APRT which encodes adenine 

phosphoribosyltransferase. This gene has LocusID: 353, and is located on chromosome 16 
with reported cytogenetic location 16q24. The gene resides in genomic locus NT_010404 
(NCBI Genome Annotation). Adenine phosphoribosyltransferase belongs to the 
purine/pyrimidine phosphoribosyltransferase family. This enzyme catalyzes the formation 
of AMP and inorganic pyrophosphate from adenine and 5-phosphoribosyl-l -pyrophosphate 
(PRPP). It can also produce adenine as a by-product of the polyamine biosynthesis 
pathway. A homozygous deficiency in this enzyme may cause 2,8-dihydroxyadenine 
urolithiasis. 

[0070] CPS 19 corresponds to PSME2 which encodes proteasome (prosome, 

macropain) activator subunit 2 (PA28 beta). This gene has LocusID: 5721, and is located 
on chromosome 14 with reported cytogenetic location 14qll.2. The gene resides in 
genomic locus NT_025892 (NCBI Genome Annotation). 

[0071] CPS 19 also aligns with LOC220462 with 97% sequence identity. 

LOC220462 encodes a protein similar to proteasome activator complex subunit 2 
(Proteasome activator 28-beta subunit) (PA28beta) (PA28b) (Activator of multicatalytic 
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protease subunit 2) (11S regulator complex beta subunit) (REG-beta). LOC220462 has 
reported cytogenetic location 13ql4.11. In addition, CPS 19 has about 90-96% sequence 
identity with LOC257093, LOC 166868, and an intron sequence of LOC152940. 
LOC257093 encodes a protein similar to proteasome activator complex subunit 2 
(Proteasome activator 28-beta subunit), and is located on chromosome 5. LOC 166868 
encodes a protein similar to PA28beta, and has reported cytogenetic location 4pl4. 
LOCI 52940 has reported cytogenetic location 4q31.3-q32.1. Furthermore, fragments of 
CPS 17 show 86-97% sequence identity with LOC220220 and LOC206704. LOC220220 
encodes a protein similar to PA28beta, and has reported cytogenetic location 10pll.23. 
LOC206704 also encodes a protein similar to PA28beta, and is located on chromosome 
8p21.1. 

[0072] CPS 20 corresponds to MTCP1 which encodes mature T-cell proliferation 1. 

This gene has LocusID: 4515, and is located on chromosome X with reported cytogenetic 
location Xq28. The gene resides in genomic locus NT_025965 (NCBI Genome 
Annotation). The gene product may be involved in the leukemogenic process of mature T 
cell proliferation. 

[0073] CPS 21 corresponds to NDUFB3 which encodes NADH dehydrogenase 

(ubiquinone) 1 beta subcomplex, 3 (12kD, B12). This gene has LocusID: 4709, and is 
located on chromosome 2 with reported cytogenetic location 2q31.3. The gene resides in 
genomic locus NT_005370 (NCBI Genome Annotation). The multisubunit 
NADH:ubiquinone oxidoreductase (complex I) is an enzyme complex in the electron 
transport chain of mitochondria. NDUFB3 gene product is a subunit of NADH-ubiquinone 
oxidoreductase (complex I), and can transport electrons from NADH to ubiquinone. 
[0074] CPS 21 also has about 96% sequence identity with NDUFB3P4. 

NDUFB3P4 encodes NADH dehydrogenase (ubiquinone) 1 beta subcomplex, 3 (12kD, 
B12) pseudogene 4. It has LocusID: 93995, and is located on chromosome 14. The gene is 
located in an intron of WARS which encodes tryptophanyl-tRNA synthetase and has 
LocusID: 7453 with reported cytogenetic location 14q32.31. 

[0075] Moreover, CPS 21 has about 86-91% sequence identity with NDUFB3P3, 

and intron sequence of KIAA0893, and an intron sequence of PPP6C. NDUFB3P3 encodes 
NADH dehydrogenase (ubiquinone) 1 beta subcomplex, 3 (12kD, B12) pseudogene. 3. It 
has LocusID: 93996, and is located on chromosome 14. KIAA0893 encodes KIAA0893 
protein, and has LocusID: 22911 with reported cytogenetic location lpl3.2. PPP6C 
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encodes protein phosphatase 6, catalytic subunit, and has LocusID: 5537 with reported 
cytogenetic location 9q34.1 1 . 

[0076] CPS 23 corresponds to CTSL which encodes cathepsin L. This gene has 

LocusID: 1514, and is located on chromosome 9 with reported cytogenetic location 9q21- 
q22. The gene resides in genomic locus NT_023935 (NCBI Genome Annotation), The 
protein encoded by CTSL gene is a lysosomal cysteine proteinase that plays a role in 
intracellular protein catabolism. Its substrates include collagen and elastin, as well as alpha- 
1 protease inhibitor, a major controlling element of neutrophil elastase activity. The 
encoded protein has been implicated in several pathologic processes, including myofibril 
necrosis in myopathies and in myocardial ischemia, and in the renal tubular response to 
proteinuria. This protein is a member of the peptidase CI family. At least two transcript 
variants encoding the same protein have been found for CTSL gene. 
[0077] Fragments of CPS 23 have 84-88% sequence identity with LOCI 18945, 

LOCI 19215, and LOC219343. LOCI 18945, LOCI 19215, and LQC219343 encode 
proteins similar to cathepsin L precursor (Major excreted protein) (MEP), and are located at 
chromosome 10q23.32, 10q21.1 and 10q23. 2, respectively. 

[0078] Affymetrix annotation suggests that CPS 24 corresponds to FGL2 which 

encodes fibrinogen-like 2 and has LocusID: 10875. FGL2 has the reported cytogenetic 
location at chromosome 7ql 1 .23. 

[0079] Blast search of the Entrez human genome database shows that CPS .24 aligns 

with the non-protein-coding strand of KIAA1505 with at least 98% sequence identity. 
KIAA1505 encodes KIAA1505 protein, and has LocusID: 57639 with reported cytogenetic 
location 7pl2.3. KIAA1505 resides in genomic locus NT_007933 (NCBI Genome 
Annotation). 

[0080] Affymetrix annotation suggests that CPS 25 corresponds to 

DKFZP564C1940 which is also known as LRP10. LRP10 encodes low density lipoprotein 
receptor-related protein 10, and has LocusID: 26020. LRP10 is located at chromosome 
Mqll.l. 

[0081] Blast search of the Entrez human genome sequence database shows that 

fragments of CPS 26 have 84-98% sequence identity with various regions on chromosomes 
1, 2, 4, 5, 7, 9, 10, 11, 12, 17, 18, X, or Y. For instance, nucleotides 164-283 and 359-589 
have 95% sequence identity with an intron sequence of LOC121292. LOC121292 is 
located at chromosome 12q24.31, and resides in genomic locus NT_035241 (NCBI Genome 
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Annotation). Nucleotides 253-323 and 359-589 of CPS 26 align with a region near 
LOC138924 with at least 95% sequence identity. LOC138924 encodes a protein similar to 
peptidyl-Pro cis trans isomerase, and is located at chromosome 9q21.31 and in genomic 
locus NTJ)08580 (NCBI Genome Annotation). Nucleotides 374-588 of CPS 26 have 93% 
sequence identity with a region between DIM1 and FLJ21172. DIM1 encodes a protein 
similar to S. pombe diml+, and has LocusID: 10907 with reported cytogenetic location 
18q23. FLJ21172 encodes hypothetical protein FLJ21172, and has LocusID: 79863 with 
reported cytogenetic location 18q23. Nucleotides 634-1116 of CPS 26 have 86% sequence 
identity with an intron sequence of MYOID. MYOID encodes myosin ID, and has 
LocusID: 4642 and reported cytogenetic location 17qll-ql2. 

[0082] CPS 27 corresponds to SEMA3F which encodes sema domain, 

immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3F, This gene has 
LocusID: 6405, and is located on chromosome 3 with reported cytogenetic location 3p21.3. 
The gene resides in genomic locus NT_006014 (NCBI Genome Annotation). The 
semaphorins are a family of proteins that are involved in signaling. A typical family 
member has a secretion signal, a 500-amino acid sema domain, and 16 conserved cysteine 
residues. Sequence comparisons have grouped the secreted semaphorins into 3 general 
classes. Members of the semaphorin IE family, including human semaphorin m, chicken 
collapsin, and mouse semaphorins A, D, and E, have a basic domain at the C terminus. 
SEMA3F gene product is a secreted member of the semaphorin M family. 
[0083] CPS 28 corresponds to BST2 which encodes bone marrow stromal cell 

antigen 2. This gene has LocusID: 684, and is located on chromosome 19 with reported 
cytogenetic location 19pl3.2. The gene resides in genomic locus NTJ)11295 (NCBI 
Genome Annotation). Bone marrow stromal cells are involved in the growth and 
development of B-cells. Bone marrow stromal cell antigen 2 may play a role in pre-B-cell 
growth and in rheumatoid arthritis. 

[0084] Blast search of the Entrez human genome sequence database shows that CPS 

29 overlaps or includes a chromosomal region between SLC7A1 and KIAA0774. SLC7A1 
encodes solute carrier family 7 (cationic amino acid transporter, y+ system), member 1. 
SLC7A1 has LocusID: 6541, and is located on chromosome 13 with reported cytogenetic 
location 13ql2-ql4. KIAA0774 encodes KIAA0774 protein, and has LocusID: 23281. 
KIAA0774 is located on chromosome 13 with reported cytogenetic location 13ql2.2. Both 
SLC7A1 and KIAA0774 have genomic locus NT_009799 (NCBI Genome Annotation). 
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CPS 29 is located 3' to the protein-coding regions of both genes. CPS 29 matches with the 
protein-coding strand of SLC7A1. SLC7A1 gene product can has strong similarity to 
murine Rec-1 (Atrcl), and can transport arginine, lysine and ornithine across the plasma 
membrane. 

[0085] CPS 30 corresponds to UBE2L6 which encodes ubiquitin-conjugating 

enzyme E2L 6. This gene has LocusID: 9246, and is located on chromosome 11 with 
reported cytogenetic location 1 lql2. The gene resides in genomic locus NT_033903 (NCBI 
Genome Annotation). The gene product is a member of the ubiquitin-conjugating enzyme 
family, and can ubiquitinate cellular proteins and mark them for degradation. The gene 
product can also bind to Hect domains of E3 proteins. 

[0086] CPS 31 corresponds to SLC11A1 which encodes solute carrier family 11 

(proton-coupled divalent metal ion transporters), member 1. This gene has LocusID: 6556, 
and is located on chromosome 2 with reported cytogenetic location 2q35. The gene resides 
in genomic locus NT_005403 (NCBI Genome Annotation). The gene product is similar to 
murine Beg (Nrampl), and may control antimicrobial activity of macrophages. 
[0087] CPS 32 corresponds to CDKN1C which encodes cyclin-dependent kinase 

inhibitor 1C (p57, Kip2). This gene has LocusID: 1028, and is located on chromosome 11 
with reported cytogenetic location 1 lpl5.5. The gene resides in genomic locus NT_009368 
(NCBI Genome Annotation). Cyclin-dependent kinase inhibitor 1C is a tight-binding 
inhibitor of several Gl cyclin/Cdk complexes and a negative regulator of cell proliferation. 
Mutations of CDKN1C are implicated in sporadic cancers and Beckwith-Wiedemann 
syndrome, suggesting that CDKN1 C may be a tumor suppressor candidate. 
[0088] CPS 32 also has 98% sequence identity with a chromosomal region near 

LOC256784. LOC256784 encodes a protein similar to cyclin-dependent kinase inhibitor 
1C (Cyclin-dependent kinase inhibitor P57) (P57KIP2), and is located on chromosome 11 
with genomic locus NT_009368 (NCBI Genome Annotation). 

[0089] CPS 33 corresponds to GAIP (RGS19) which encodes regulator of G-protein 

signaling 19. This gene has LocusID: 10287, and is located on chromosome 20 with 
reported cytogenetic location 20ql3.3. The gene resides in genomic locus NTJ)11333 
(NCBI Genome Annotation). G proteins mediate a number of cellular processes. The 
protein encoded by this gene belongs to the RGS (regulators of G-protein signaling) family 
and can interact with G protein, GAI3. G-protein signaling 19 is a guanosine 
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triphosphatase-activating protein that may function to down-regulate Galpha i/Galpha q- 
linked signaling. 

[0090] CPS 34 corresponds to TACC2 which encodes transforming, acidic coiled- 

coil containing protein 2. This gene has LocusID: 10579, and is located on chromosome 10 
with reported cytogenetic location 10q26. The gene resides in genomic locus NT_030764 
(NCBI Genome Annotation). Transforming acidic coiled-coil proteins are a conserved 
family of centrosome- and microtubule-interacting proteins that are implicated in cancer. 
The member encoded by TACC2 gene concentrates at centrosomes throughout the cell 
cycle, and it is a candidate breast tumor suppressor and biomarker for tumor progression. 
[0091] CPS 35 corresponds to CLNS1A which encodes chloride channel, 

nucleotide-sensitive, 1A. This gene has LocusID: 1207, and is located on chromosome 11 
with' reported cytogenetic location Ilql3.5-ql4. The gene resides in genomic locus 
NT_033927 (NCBI Genome Annotation). The gene product is associated with a swelling- 
induced chloride channel, and may be involved in aqueous humor formation in the eye 
[0092] CPS 35 also has 88% sequence identity with an intron sequence of TDRKH. 

TDRKH encodes tudor and KH domain-containing protein, and has LocusID: 1 1022. It is 
located at chromosome lq2L In addition, nucleotides 66-459 of CPS 35 have 89% 
sequence identity with a region near LOC221349. LOC221349 encodes a protein similar to 
chloride conductance regulatory protein ICln <I(Cln)) (Chloride channel, nucleotide 
sensitive 1A) (Chloride ion current inducer protein) (C1CI) (Reticulocyte PICIn), and has 
reported cytogenetic location 6pll.2. Nucleotides 1-66 of CPS 35 aligns with LOC152922 
with 98% sequence identity. LOCI 52922 encodes a protein similar to chloride conductance 
regulatory protein ICln (I(Cln)) (Chloride channel, nucleotide sensitive 1A) (Chloride ion 
current inducer protein) (C1CI) (Reticulocyte PICIn), and is located at chromosome 4q32.3. 
[0093] CPS 36 corresponds to DPYSL2 which encodes dmydropyrimidinase-like 2. 

This gene has LocusID: 1808, and is located on chromosome 8 with reported cytogenetic 
location 8p22-p21. The gene resides in genomic locus NT023666 (NCBI Genome 
Annotation). The gene product is a member of the dihydropyrimidinase family. 
[0094] CPS 37 corresponds to FCGR3A which encodes Fc fragment of IgG, low 

affinity EQa, receptor for (CD16). This gene has LocusID: 2214, and is located on 
chromosome 1 with reported cytogenetic location lq23. The gene resides in genomic locus 
NT_004668 (NCBI Genome Annotation). The gene product is type IE Fc gamma receptor, 
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and can associate with zeta chain of the T-cell receptor complex (CD3Z). The gene product 
is a member of the immunoglobulin superfamily 

[0095] CPS 38 corresponds to UQCRC1 which encodes ubiquinol-cytochrome c 

reductase core protein I. This gene has LocusID: 7384, and is located on chromosome 3 
with reported cytogenetic location 3p21.3. The gene resides in genomic locus NT_005990 
(NCBI Genome Annotation). The gene product, core I protein, is a subunit of the 
ubiquinol-cytochrome c oxidoreductase in the mitochondrial respiratory chain. 
[0096] CPS 39 corresponds to GZMA which encodes granzyme A (granzyme 1, 

cytotoxic T-lymphocyte-associated serine esterase 3). This gene has LocusID: 3001, and is 
located on chromosome 5 with reported cytogenetic location 5ql l-ql2. The gene resides in 
genomic locus NT_006431 (NCBI Genome Annotation). Cytolytic T lymphocytes (CTL) 
and natural killer (NK) cells share the ability to recognize, bind, and lyse specific target 
cells* They are thought to protect their host by lysing cells bearing on their surface 
"nonself * antigens, usually peptides or proteins resulting from infection by intracellular 
pathogens. GZMA gene product is a T cell- and natural killer cell-specific serine protease 
that may junction as a common component necessary for lysis of target cells by cytotoxic T 
lymphocytes and natural killer cells. 

[0097] CPS 41 corresponds to PSMB10 which encodes proteasome (prosome, 

i 

macropain) subunit, beta type, 10. This gene has LocusID: 5699, and is located on 
chromosome 16 with reported cytogenetic location 16q22.1. The gene resides in genomic 
locus NT_010478 (NCBI Genome Annotation). The gene can replace beta subunit PSMB7 
when cells are stimulated by interferon g. 

[0098] CPS 42 corresponds to IDS which encodes iduronate 2-sulfatase (Hunter 

syndrome). This gene has LocusID: 3423, and is located on chromosome X with reported 
cytogenetic location Xq28. The gene resides in genomic locus NTJH9686 (NCBI Genome 
Annotation). Iduronate-2-sulfatase is involved in the lysosomal degradation of heparan 
sulfate and dermatan sulfate. Mutations in this X-chromosome gene that result in enzymatic 
deficiency may lead to the sex-linked Mucopolysaccharidosis Type II, also known as 
Hunter Syndrome. Iduronate-2-sulfatase has a sequence homology with human 
arylsulfatases A, B, and C, and human glucosamine-6-sulfatase. A splice variant of this 
gene has been described. 
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[0099] Affymetrix annotation suggests that CPS 43 corresponds to AHNAK which 

encodes AHNAK nucleoprotein (desmoyokin) and has LocusID: 195. The gene is located 
at chromosome 1 1 ql 2-ql 3 . 

[0100] Blast search of the Entrez human genome sequence database indicates that 

CPS 43 aligns with a region 3' to LOC221087 with 100% sequence identity. LOC221087 
is hypothetical gene supported by M80899. The gene is located on chromosome llql3.1, 
and resides in genomic locus NT_033241 (NCBI Genome Annotation), 
[0101] CPS 44 corresponds to PTPN7 which encodes protein tyrosine phosphatase, 

non-receptor type 7. This gene has LocusID: 5778, and is located on chromosome 1 with 
reported cytogenetic location lq32.1. The gene resides in genomic locus NTJ)34408 
(NCBI Genome Annotation). The gene product is a member of the protein tyrosine 
phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a 
variety of cellular processes including cell growth, differentiation, mitotic cycle, and 
oncogenic transformation. PTPN7 gene is expressed in a variety of hematopoietic cells, and 
is an early response gene in lymphokine stimulated cells. The noncatalytic N-terminus of 
PTPN7 gene product can interact with MAP kinases and suppress the MAP kinase 
activities. The gene product is also shown to be involved in the regulation of T cell antigen 
receptor (TCR) signaling, which is thought to function through dephosphorylating the 
molecules related to MAP kinase pathway. At least three alternatively spliced transcript 
variants of this gene, which encode at least two distinct isoforms, have been reported. 
[0102] CPS 45 corresponds to DUSP2 which encodes dual specificity phosphatase 

2. This gene has LocusID: 1844, and is located on chromosome 2 with reported cytogenetic 
location 2qll. The gene resides in genomic locus NT_026970 (NCBI Genome 
Annotation). The protein encoded by DUSP2 is a member of the dual specificity protein 
phosphatase subfamily. These phosphatases can inactivate their target kinases by 
dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They 
negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily 
(MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and 
differentiation. Different members of the family of dual specificity phosphatases show 
distinct substrate specificities for various MAP kinases, different tissue distribution and 
subcellular localization, and different modes of inducibility of their expression by 
extracellular stimuli. DUSP2 gene product can inactivate ERK1 and ERK2, is expressed in 
hematopoietic tissues, and is localized in the nucleus. 

57 



WO 2004/072265 



PCTAJS2004/004118 



[0103] Aflymetrix annotation suggests that CPS 46 corresponds to PSMB9 which 

encodes proteasome (prosome, macropain) subunit, beta type, 9 (large multifunctional 
protease 2). PSMB9 has LocusID: 5698, and is located at chromosome 6p21.3. 
[0104] Blast search of the Entrez human genome sequence database indicates that 

CPS 46 shows 100% sequence identity with three regions near PSMB8. These regions are 
within genomic locus NT_007592 (NCBI Genome Annotation). PSMB8 encodes 
proteasome (prosome, macropain) subunit, beta tyjje, 8 (large multifunctional protease 7). 
It has LocusID: 5696 and reported cytogenetic location 6p2L3. 

[0105] Blast search of the Entrez human genome sequence database shows that CPS 

47 overlaps or includes an intron sequence of HANI 1 . HANI 1 encodes WD-repeat protein, 
and has LocusID: 10238 with reported cytogenetic location 17q2L33. HAN11 resides in 
genomic locus NT_035428 (NCBI Genome Annotation). HAN11 gene product has WD- 
repeats, and is similar to plant anl 1 . 

[0106] Blast search of the Entrez human genome sequence database also shows that 

CPS 48 overlaps or includes LILRB1. LILRB1 encodes leukocyte immunoglobulin-like 
receptor, subfamily B (with TM and ITIM domains), member 1. LILRB1 has LocusID: 
10859, and is located on chromosome 19 with reported cytogenetic location 19ql 3.4. 
LILRB1 resides in genomic locus NT_011225 (NCBI Genome Annotation), The gene 
product (leukocyte immunoglobulin-like receptor Bl) contains immunoreceptor tyrosine- 
based inhibitory motifs, and can bind to cellular and viral MHC class I antigens. 
[0107] CPS 48 also aligns with LILRB2 with 91% sequence identity. LILRB2 

encodes leukocyte immunoglobulin-like receptor, subfamily B (with TM and ITIM 
domains), member 2. The gene has LocusID: 10288 and reported cytogenetic location 
19ql3.4. 

[0108] CPS 49 corresponds to SDR1 which encodes short-chain 

dehydrogenase/reductase 1. This gene has LocusID: 9249, and is located on chromosome 1 
with reported cytogenetic location lp36.1. The gene resides in genomic locus NT_022041 
(NCBI Genome Annotation). Short-chain dehydrogenase/reductase 1 can reduce all-trans- 
retinal during bleached visual pigment regeneration. 

[0109J Aflymetrix annotation suggests that CPS 50 corresponds to POLR2L which 

encodes polymerase (RNA) H (DNA directed) polypeptide L, 7.6kDa. POLR2L has 
LocusID: 5441, and is located at chromosome llp!5. 
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[0110] CPS 51 corresponds to SCYA2 which encodes small inducible cytokine A2 

(monocyte chemotactic protein 1). This gene has LocusID: 6347, and is located on 
chromosome 17 with reported cytogenetic location 17qll.2-q21.1. The gene resides in 
genomic locus NT_010799 (NCBI Genome Annotation). Cytokine A 2 is a chemotactic 
factor for monocytes. 

[0111] CPS 52 corresponds to PAI2 (SERPINB2) which encodes serine (or 

cysteine) proteinase inhibitor, clade B (ovalbumin), member 2. This gene has LocusID: 
5055, and is located on chromosome 18 with reported cytogenetic location 18q21.3. The 
gene resides in genomic locus NT_033907 (NCBI Genome Annotation). The gene product 
is also known as plasminogen activator inhibitor II, and may function as a serine protease 
inhibitor. It is considered a member of the seipin family of serine protease inhibitors. 
[0112] Blast search of the Entrez human genome sequence database shows that CPS 

53 overlaps or includes LOC158972. LOC158972 has reported cytogenetic location Xq28, 
and is located in genomic locus NT_026504 (NCBI Genome Annotation). 

[0113] CPS 53 also aligns with a chromosomal region near LOC93052 with 93% 

sequence identity. LOC93052 has reported cytogenetic location Xq28. 

[0114] Blast search of the Entrez human genome sequence database shows that CPS 

54 overlaps or includes KIAA1564. KIAA1564 encodes KIAA1 564 protein, and has 
LocusID: 57680 with reported cytogenetic location 14qll.l. KIAA1564 resides in genomic 
locus NT_025892 (NCBI Genome Annotation). CPS 54 matches with the non-protein- 
coding strand of KIAA1564. 

[0115] Asymetrix annotation indicates that CPS 55 corresponds to JMJ. 

Nucleotides 1997 to 198161 of AL021938 (SEQ ID NO:52) have 99% sequence identity 
with JMJ. JMJ encodes jumonji homolog (mouse), and has LocusID: 3720. The gene is 
located on chromosome 6 with reported cytogenetic location 6p24-p23 and genomic locus 
NT_007592 (NCBI Genome Annotation). JMJ is an ortholog of the mouse jumonji gene, 
which encodes a nuclear protein involved in mouse embryogenesis, including neural tube 
formation. Overexpression of mouse jumonji negatively regulates cell proliferation. The 
jumonji proteins contain a DNA-binding domain, called an AT-rich interaction domain 
(ARID), and share regions of similarity with human retinoblastoma-binding protein-2 and 
the human SMCX protein. Nucleotides 1997 to 198161 of AL021938 match with the non- 
protein-coding strand of JMJ. 
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[0116] Nucleotides 1 to 66 of CPS 55 have 100% sequence identity with a 

chromosomal region in genomic locus NT_010463 (NCBI Genome Annotation). This 
chromosomal region is located 3 5 to LOC222499 which has reported cytogenetic location 
16q21. 

[0117] CPS 56 corresponds to UGP2 which encodes UDP-glucose 

pyrophosphorylase 2. This gene has LocusID: 7360, and is located on chromosome 2 with 
reported cytogenetic location 2pl4-pl3. The gene resides in genomic locus NT_005375 
(NCBI Genome Annotation). The enzyme encoded by UGP2 is an intermediary in 
mammalian carbohydrate interconversions. It can transfer a glucose moiety from glucose- 
1 -phosphate to MgUTP and form UDP-glucose and MgPPi. 

[0118] Nucleotides 155 to 433 of CPS .56 have 96% sequence identity with 

LOC253415. LOC253415 encodes a protein similar to UDP-glucose pyrophosphorylase 2 
(UTP-glucose-1 -phosphate uridyltransferase) (UDP-glucose diphosphorylase) (UGPase 2). 
LOC25341 5 is located on chromosome 2. 

[0119] CPS 57 corresponds to KIAA0410. This gene has LocusID: 9818, and is 

located on chromosome 13 with reported cytogenetic location 13ql2.12. The gene resides 
in genomic locus NT_009799 (NCBI Genome Annotation). 

[0120] CPS 58 corresponds to NDUFS7 which encodes NADH dehydrogenase 

(ubiquinone) Fe-S proteiii 7 (20kD) (NADH-coenzyme Q reductase). This gene has 
LocusID: 4727, and is located on chromosome 19 with reported cytogenetic location 19pl3. 
The gene resides in genomic locus NT_01 1268 (NCBI Genome Annotation). 
[0121] CPS 59 corresponds to KIAA0645. This gene has LocusID: 9681, and is 

located on chromosome 22 with reported cytogenetic location 22ql2.3. The gene resides in 
genomic locus NTJ01 1520 (NCBI Genome Annotation). 

[0122] CPS 60 corresponds to GSTP1 which encodes glutathione S-transferase pi. 

This gene has LocusID: 2950, and is located on chromosome 11 with reported cytogenetic 
location llql3. The gene resides in genomic locus NT_033241 (NCBI Genome 
Annotation). Glutathione S-transferases (GSTs) are a family of enzymes that play a role in 
detoxification by catalyzing the conjugation of many hydrophobic and electrophilic 
compounds with reduced glutathione. The soluble GSTs are categorized into 4 main 
classes: alpha, mu, pi, and theta. The glutathione S-transferase pi gene (GSTP1) is a 
polymorphic gene encoding active, functionally different GSTP1 variant proteins that are 
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thought to function in xenobiotic metabolism and play a role in susceptibility to cancer, and 
other diseases. 

[0123] Nucleotides 180 to 558 have 86% sequence identity with a chromosomal 

region near DGKA. DGK encodes diacylglycerol kinase, alpha (80kD), and is located at 
chromosome 12ql3.3 with LocusID: 1606. 

[0124] CPS 61 corresponds to DECR1 which encodes 2,4-dienoyl CoA reductase 1, 

mitochondrial. This gene has LocusID: 1666, and is located on chromosome 8 with 
reported cytogenetic location 8q21.3. The gene resides in genomic locus NT_034898 
(NCBI Genome Annotation). The gene product is an accessory enzyme which participates 
in the beta-oxidation and metabolism of unsaturated fatty enoyl-CoA esters. 
[0125] CPS 62 corresponds to PLXNC1 which encodes plexin CI. This gene has 

LocusID: 10154, and is located on chromosome 12 with reported cytogenetic location 
12q23.3. The gene resides in genomic locus NT_009575 (NCBI Genome Annotation). 
Plexin CI can function as a receptor for virally-encoded semphorin. It is a member of the 
plexin family. 

[0126] CPS 63 corresponds to TUBA2 which encodes tubulin, alpha 2. This gene 

has LocusID: 7278, and is located on chromosome 13 with reported cytogenetic location 
13qll. The gene resides in genomic locus NTJ)09799 (NCBI Genome Annotation). 
Microtubules of the eukaryotic cytoskeleton perform essential and diverse functions and are 
composed of a heterodimer of alpha and beta tubulin. The genes encoding these 
microtubule constituents are part of the tubulin superfamily, which is composed of six 
distinct families. Genes from the alpha, beta and gamma tubulin families are found in all 
eukaryotes. The alpha and beta tubulins represent the major components of microtubules, 
while gamma tubulin plays a critical role in the nucleation of microtubule assembly. There 
are multiple alpha and beta tubulin genes and they are conserved among and between 
species. TUBA2 gene is an alpha tubulin gene that encodes a protein similar to the mouse 
testis-specific Tuba3 and Tuba7 gene products. TUBA2 gene is located in the 13qll 
region, which is associated with the genetic diseases Clouston hidrotic ectodermal dysplasia 
and Kabuki syndrome. Alternative splicing has been observed for this gene and at least two 
variants have been identified. 

[0127] CPS 63 has 95-96% sequence identity with H2-ALPHA and LOCI 12714. 

H2-ALPHA encodes alpha-tubulin isotype H2-alpha, and has LocusID: 113457 with 
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reported cytogenetic location 2q22.1. LOC112714 encodes a protein similar to alpha 
tubulin, and has LocusID: 1 12714 with reported cytogenetic location 2ql4.2. 
[0128] In addition, CPS 63 shows 85-90% sequence identity with a chromosomal 

region near MGC16703. MGC16703 encodes alpha tubulin-like, and is located at 
chromosome 22qll,21 with LocusID: 11369L Fragments of CPS have 83-91% sequence 
identity with regions on chromosomes 1 and 22. 

[0129] CPS 64 corresponds to NME1 which encodes non-metastatic cells 1, protein 

(NM23A) expressed in. This gene has LocusID: 4830, and is located on chromosome 17 
with reported cytogenetic location 17q21.3. The gene resides in genomic locus NT_0 10783 
(NCBI Genome Annotation). NME1 was reported to have reduced mRNA transcript levels 
in highly metastatic cells. NME1 encodes the "A" isoform of nucleoside diphosphate 
kinase (NDK). NDK exists as a hexamer composed of the "A" (encoded by NME1) and 
"B" (encoded by NME2) isoforms. Mutations in NME1 have been identified in aggressive 
neuroblastomas. NME1 gene product may have a role in the transcriptional regulation of c- 
myc expression. 

[0130] Blast search of the Entrez human genome sequence database shows that CPS 

65 overlaps or includes a chromosomal region between SLC1 1 Al and NLI-IF. Both genes 
are located on chromosome 2 with reported cytogenetic location 2q35 and genomic locus 
NTJJ05403 (NCBI Genome Annotation). SLC11A1 encodes solute carrier family 11 
(proton-coupled divalent metal ion transporters), member 1, and has LocusID: 6556. 
SLC11A1 gene product is similar to murine Beg (Nrampl), and may control antimicrobial 
activity of macrophages. NLI-IF encodes nuclear LIM interactor-interacting factor, and has 
LocusID: 58190. NLI-IF gene product is similar to a region of S. cerevisiae plasma 
membrane phosphatase Psr2p. CPS 65 is located 3 V to the protein-coding sequence of 
SLC1 1A1 and 5' to the protein-coding sequence of NLI-IF. 

[0131] Asymetrix annotation suggests that CPS 66 corresponds to RNAHP which 

encodes RNA helicase-related protein and has LocusID: 11325. The gene has reported 
cytogenetic location at chromosome 17q22. 

[0132] Blast search of the Entrez human genome sequence database shows that CPS 

66 aligns with a chromosomal region at chromosome 16ql3 with at least 95% sequence 
identity. This region resides in genomic locus NT_010498 (NCBI Genome Annotation), 
and is located near MT1G. MT1G encodes metallothionein 1G, and has LocusID: 4495. 
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[0133] In addition, CPS 66 has 83-92% sequence identity with various regions on 

chromosomes 1, 4, 9, 16 and 20. These regions include MT1H, MT2P1, LOC255917, 
LOC127544, LOC149450, a region near MGC10731, and a region near MMP24. MT1H 
encodes metallothionein 1H, and has LocusID: 4496 with reported cytogenetic location 
16ql3. MT2P1 encodes metallothionein 2 pseudogene 1 (processed), and is located at 
chromosome 4pll-q21 (LocusID: 4503). MGC10731 has LocusID: 79363, and is located 
at chromosome lp36.13. LOC255917 encodes a protein similar to Metallothionein-IE (MT- 
1E), and is located on chromosome 9. LOCI 27544 encodes a protein similar to 
(1J1174N9.1 (novel protein with IBR domain), and is located at chromosome lp34.3. 
LOCI 49450 has reported cytogenetic location lq42.3. MMP24 encodes matrix 
metalloproteinase 24 (membrane-inserted), and is located at chromosome 20qll.2 with 
LocusID: 10893, 

[0134] CPS 67 corresponds to ZNF198 which encodes zinc finger protein 198. This 

gene has LocusID: 7750, and is located on chromosome 13 with reported cytogenetic 
location 13qll-ql2. . The gene resides in genomic locus NT_009799 (NCBI Genome 
Annotation). Zinc-finger protein 198 contains zinc fingers. 

[0135] Nucleotides 185-221 of CPS 67 have 100% sequence identity with an intron 

sequence of LOC205936. LOC205936 is located at chromosome 4pl6.2, and resides in 
genomic locus NT_006051 (NCBI Genome Annotation) 

[0136] CPS 68 corresponds to PRCP which encodes prolylcarboxypeptidase 

(angiotensinase C). This gene has LocusID: 5547, and is located on chromosome 11 with 
reported cytogenetic location 1 lql4. The gene resides in genomic locus NT_033927 (NCBI 
Genome Annotation). Prolylcarboxypeptidase (iangiotensinase C) is a serine 
carboxypeptidase and can remove residues linked to proline. 

[0137] CPS 70 corresponds to DKFZP586A0522 which encodes DKFZP586A0522 

protein. This gene has LocusID: 25840, and is located on chromosome 12 with reported 
cytogenetic location 12qll. The gene resides in genomic locus NT_009782 (NCBI 
Genome Annotation). The gene product include a region with low sequence similarity to a 
region of S. cerevisiae Coq5p. DKFZP586A0522 overlaps with LOC196529 which is 
encodes a protein similar to DKJFZP586A0522 protein. 

[0138] CPS 71 corresponds to SFRS11 which encodes splicing factor, 

arginine/serine-rich 11. This gene has LocusID: 9295, and is located on chromosome 1 
with reported cytogenetic location Ip21-p34. The gene resides in genomic locus 
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NT_004464 (NCBI Genome Annotation). The gene product contains arginine/serine-rich 
domain and an RRM domain, and may have a role in pre-mRNA splicing. 
[0139] Nucleotides 1 to 234 of CPS 71 have 89% sequence identity with an intron 

sequence of PEPP2. PEPP2 encodes phosphoinositol 3-phosphate-binding protein-2, and 
has LocusID: 54477 with reported cytogenetic location 12pl2. 

[0140] CPS 72 corresponds to RAC2 which encodes ras-related C3 botulinum toxin 

substrate 2 (rho family, small GTP binding proteiii Rac2). This gene has LocusID: 5880, 
and is located on chromosome 22 with reported cytogenetic location 22ql3.1. The gene 
resides in genomic locus NTJ)11520 (NCBI Genome Annotation). The protein encoded by 
RAC2 is a GTPase which belongs to the RAS superfamily of small GTP-binding proteins. 
Members of this superfamily appear to regulate a diverse array of cellular events, including 
the control of cell growth, cytoskeletal reorganization, and the activation of protein kinases. 
The RAC2 gene product may be a target for ADP ribosylation by the C3 subunit of 
botulinum toxin. 

[0141] Nucleotides 400 to 487 of CPS 72 have 95% sequence identity with 

STK17A. STK17A encodes serine/threonine kinase 17a (apoptosis-inducing), and has 
LocusID: 9263 with reported cytogenetic location 7pl2-pl4. 

[0142] CPS 73 corresponds to ZFP103 which encodes zinc finger protein 103 

homolog (mouse). This gene has LocusID: 7844, and is located on chromosome 2 with 
reported cytogenetic location 2pl 1.2. The gene resides in genomic locus NT_0158O5 NCBI 
Genome Annotation). The gene product contains a zinc-finger domain, and may be 
associated with membranous protein sorting. 

[0143] CPS 74 corresponds to LOC51580 which encodes H-2K binding factor-2. 

This gene has LocusID: 51580, and is located on chromosome 9. The gene resides in 
genomic locus NT_006316 (NCBI Genome Annotation). The gene product is a member of 
the recombination signal-sequence binding protein family. It is a transcription factor that 
binds to the NFkB site of MHC class I genes. 

[0144] CPS 75 corresponds to NRD1 which encodes nardilysin (N-arginine dibasic 

convertase). This gene has LocusID: 4898, and is located on chromosome 1 with reported 
cytogenetic location Ip32.2-p32,l. The gene resides in genomic locus NT_004424 (NCBI 
Genome Annotation). N-arginine dibasic convertase (NRD convertase) is a zinc-dependent 
endopeptidase. It is a member of the insulinase family. 
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[0145] Blast search of the Entrez human genome sequence database shows that CPS 

76 overlaps or includes the 3' untranslated region of FLJ20154. FLJ20154 encodes 
hypothetical protein FLJ20154, and is located at chromosome 10q24.32 with LocusID: 
54838. FLJ20154 resides in genomic locus NT_030059 (NCBI Genome Annotation). 
[0146] CPS 76 also has 85-88% sequence identity with an intron sequence of 
K1AA0103, a chromosome 13 region near LOCI 60822, and a chromosome 7 region near 
BAP29. KIAA0103 has LocusID: 9694 and reported cytogenetic location 8q23.1. 
LOC160822 encodes a protein similar to hypothetical protein FLJ12457, and is located at 
chromosome 13q31.1. BAP29 encodes B-cell receptor-associated protein BAP29, and is 
located at chromosome 7q22.2 with LocusID: 55973. Moreover, fragments of CPS 76 align 
with various other regions on chromosome 1, 6, 15 and 19 with 89-90% sequence identity. 
[0147] Afrymetrix annotation suggests that CPS 77 corresponds to KIAA0906, also 
known as NUP210 which encodes nucleoporin 210. The gene has LocusID: 23225, and is 
located at chromosome 3p25.1. 

[0148] Blast search of the Entrez human genome sequence database shows that CPS 

77 aligns with FLJ22389 with at least 99% sequence identity. FLJ22389 has LocusID: 
79985, and is located on chromosome 3 with reported cytogenetic location 3p25.1. The 
gene resides in genomic locus NT_005927 (NCBI Genome Annotation). 

[0149] Afrymetrix annotation suggests that CPS 78 corresponds to MT1F which 

encodes metallothibnein IF (functional) and has LocusID: 4494. The gene is located at 
16ql3. 

[0150] Blast search of the Entrez human genome sequence database shows that CPS 

78 has 100% sequence identity with a chromosomal region located 3' to the protein-coding 
sequence of MT1G. This chromosomal region, as well as MT1G, are within genomic locus 
NT_010498 (NCBI Genome Annotation) on chromosome 16. MT1G encodes 
metallothionein 1G, and has LocusID: 4495 with reported cytogenetic location 16ql3. 
[0151] Nucleotides 1 to 67 of CPS 78 have 90-95% sequence identity to various 
regions in genomic locus NT_010498. These regions include MT2A, LOC221228, MT1G 
and MT1L. MT2A encodes metallothionein 2A, and has LocusID: 4502 with reported 
cytogenetic location 16ql3. LOC221228 is a hypothetical gene supported by AF495759. 
MT1G encodes metallothionein 1G. MT1L encodes metallothionein 1L, and has LocusID: 
4500 with reported cytogenetic location 16ql3. 
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[0152] In addition, CPS 78, and certain fragments thereof, have 89-93% sequence 

identity with MT2P1, LOCI 27544, LOC255917, a chromosomal region near MMP24, a 
chromosomal region hear MGC10731, and a chromosomal region near LOC149450. 
MT2P1 encodes metallothionein 2 pseudogene 1 (processed), and has LocusID: 4503 with 
reported cytogenetic location 4pll-q21. LOC127544 encodes a protein similar to 
dJ1174N9.1 (novel protein with IBR domain), and is located at chromosome lp34.3. 
LOC255917 encodes a protein similar to metallothionein-IE (MT-1E), and is located on 
chromosome 9. MMP24 encodes matrix metalloproteinase 24 (membrane-inserted), and is 
located at chromosome 20qll.2 with LocusID: 10893. MGC10731 encodes hypothetical 
protein MGC10731, and has LocusID: 79363 with reported cytogenetic location lp36.13. 
LOC149450 is located at chromosome lq42.3. 

[0153] CPS 79 corresponds to P85SPR (ARHGEF7) which encodes Rho guanine 

nucleotide exchange factor (GEF) 7. This gene has LocusID: 8874, and is located on 
chromosome 13 with reported cytogenetic location 13q34. The gene resides in genomic 
locus NT 009952 (NCBI Genome Annotation). Rho GTPases are involved in numerous 
cellular processes that are initiated by extracellular stimuli that work through G protein 
coupled receptors. The protein encoded by P85SPR belongs to a family of cytoplasmic 
proteins that activate the Ras-like family of Rho proteins by exchanging bound GDP for 
GTP. The encoded protein may form a complex with G proteins and stimulate Rho- 
dependent signals. The protein can induce membrane ruffling. The gene product is also 
involved in Pak recruitment to Cdc42- and Racl-driven focal complexes. Multiple 
alternatively spliced transcript variants encoding different isoforms have been described for 
this gene. 

[0154] CPS 80 corresponds to UNK_N53547 (MGC5508) which encodes 

hypothetical protein MGC5508. This gene has LocusID: 79073, and is located on 
chromosome 11 with reported cytogenetic location 1 lql3.1. The gene resides in genomic 
locus NT_033903 (NCBI Genome Annotation). 

[0155] CPS 81 corresponds to GPRK5 which encodes G protein-coupled receptor 

kinase 5. This gene has LocusID: 2869, and is located on chromosome 10 with reported 
cytogenetic location 10q24-qter. The gene resides in genomic locus NT_008902 (NCBI 
Genome Annotation). G protein-coupled receptor kinases (GRKs) play a role in 
phosphorylating and regulating the activity of a variety of G protein-coupled receptors. G 
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protein-coupled receptor kinase 5 can phosphorylate agonist-stimulated G protein-coupled 
receptors. 

[0156] CPS 83 corresponds to SCYA7 which encodes small inducihle cytokine A7 

(monocyte chemotactic protein 3). This gene has LocusID: 6354, and is located on 
chromosome 17 with reported cytogenetic location 17qll.2-ql2. The gene resides in 
genomic locus NT_010799 (NCBI Genome Annotation). Monocyte chemotactic protein 3 
is a secreted chemokine which attracts macrophages, during inflammation and metastasis. It 
is a member of the C-C subfamily of chemokines 'which are characterized by having two 
adjacent cysteine residues. The protein is an in vivo substrate of matrix metalloproteinase 
2, an enzyme which degrades components of the extracellular matrix. SCYA7 gene is part 
of a cluster of C-C chemokine family members on chromosome 17q. 
[0157] CPS 84 corresponds to TUB A3 which encodes tubulin, alpha 3. This gene 

has LocusID: 7846, and is located on chromosome 12 with reported cytogenetic location 
12ql2-12ql4.3. The gene resides in genomic locus NT_009526 (NCBI Genome 
Annotation). There are multiple alpha and beta tubulin genes, which are conserved among 
species. TUB A3 encodes alpha tubulin and is similar to mouse and rat Tubal gene. 
Northern blotting studies have shown the gene expression in morphologically differentiated 
neurologic cells. TUBA3 is one of three alpha-tubulin genes in a cluster on chromosome 
12q. 

[0158] CPS 84 also has 97% sequence identity with a chromosomal region located 

3' to LOC134262. LOC134262 encodes a protein similar to alphaTub84B gene product. 

The gene is located at 5pl 1, and resides in genomic locus NTJ)23098. 

[0159] CPS 85 corresponds to SCML2 which encodes sex comb on midleg-like 2 

(Drosophila). This gene has LocusID: 10389, and is located on chromosome X with 

reported cytogenetic location Xp22. The gene resides in genomic locus NT_01 1586 (NCBI 

Genome Annotation). The gene product is similar to Drosophila Scm. 

[0160] CPS 85 also aligns with SCML1 with 94% sequence identity. SCML1 

encodes sex comb on midleg-like 1 (Drosophila). The gene has LocusID: 6322, and is 

located on chromosome X with reported cytogenetic location Xp22.2-p22.1. The gene 

resides in genomic locus NT_01 1586. 

[0161] CPS 87 corresponds to IL1R1 which encodes interleukin 1 receptor, type I. 

This gene has LocusID: 3554, and is located on chromosome 2 with reported cytogenetic 
location 2ql2. The gene resides in genomic locus NT_G22171 (NCBI Genome 
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Annotation). Type I interleukin-1 receptor contains immmunoglobulin domains, and can 
bind to all three forms of interleukin-1 (ILIA, IL1B, and IL1RN). 

[0162] CPS 88 corresponds to UNK_AL008729 (LOC221692) which encodes a 

protein similar to KIAA1733 protein. This gene is located on chromosome 6 with reported 
cytogenetic location 6p22.3. The gene resides in genomic locus NTJ)07592 (NCBI 
Genome Annotation). LOC221692 is located within LOC51256 which has LocusID: 
51256. CPS 88 matches with the non-protein-coding strand of LOC51256. 
[0163] CPS 89 corresponds to KIAA0191 which encodes KIAA0191 protein. This 

gene has LocusID: 23318, and is located on chromosome 1 with reported cytogenetic 
location lp32.3. The gene resides in genomic locus NT_004424 (NCBI Genome 
Annotation). 

[0164] CPS 90 corresponds to EGFL5 which encodes EGF-like-domain, multiple 5. 

This gene has LocusID: 1955, and is located on chromosome 9 with reported cytogenetic 
location 9q32-q33.3. The gene resides in genomic locus NT_017568 (NCBI Genome 
Annotation). 

[0165] CPS 91 corresponds to DUSP1 which encodes dual specificity phosphatase 

1. This gene has LocusID: 1843, and is located on chromosome 5 with reported cytogenetic 
location 5q34. The gene resides in genomic locus NT_023132 (NCBI Genome 
Annotation). The expression of DUSP1 gene can be induced in human skin fibroblasts by 
oxidative/heat stress and growth factors. The bacterially expressed and purified DUSP1 
protein has intrinsic phosphatase activity, and can inactivate mitogen-activated protein 
(MAP) kinase in vitro by the concomitant dephosphorylation of both its phosphothreonine 
and phosphotyrosine residues. DUSP1 protein can also suppress the activation of MAP 
kinase by oncogenic ras in extracts of Xenopus oocytes, thus, DUSP1 may play a role in 
the human cellular response to environmental stress as well as in the negative regulation of 
cellular proliferation. 

[0166] CPS 92 corresponds to FBP1 which encodes fructose- 1,6-bisphosphatase 1. 

This gene has LocusID: 2203, and is located on chromosome 9 with reported cytogenetic 
location 9q22.3. The gene resides in genomic locus NT_008476 (NCBI Genome 
Annotation). Fructose-l,6-bisphosphatase 1 is a gluconeogenesis regulatory enzyme. It can 
catalyze the hydrolysis of fructose 1,6-bisphosphate to fructose 6-phosphate and inorganic 
phosphate. Fructose- 1,6-diphosphatase deficiency is associated with hypoglycemia and 
metabolic acidosis. 
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[0167] CPS 93 corresponds to HRB which encodes HIV- 1 Rev binding protein. 

This gene has LocusID: 3267, and is located on chromosome 2 with reported cytqgenetic 
location 2q36. The gene resides in genomic locus NT_005403 (NCBI Genome 
Annotation). HIV^l Rev binding protein can interact with the activation domains of the 
HIV-1 Rev protein, and may be related to nucleoporins, a class of proteins that mediate 
nucleocytoplasmic transport. HIV-1 Rev binding protein binds to the Rev activation 
domain when Rev is assembled onto its RNA target and can significantly enhance Rev 
activity when over-expressed. The HRB gene can expressed as a major 2.8-kb and a minor 
4.6-kbmRNA. 

[0168] CPS 94 corresponds to NAGA which encodes N-acetylgalactosaminidase, 

alpha-. This gene has LocusID: 4668, and is located on chromosome 22 with reported 
cytogenetic location 22ql3-qter. The gene resides in genomic locus NT_0 11520 (NCBI 
Genome Annotation). The lysosomal enzyme alpha-N-acetylgalactosaminidase can cleave 
alpha-N-acetylgalactosaminyl moieties from glycoconjugates. Mutations in NAGA have 
been implicated as the cause of Schindler disease types I and II (type II also known as 
Kanzaki disease). 

[0169] CPS 95 corresponds to GLRX which encodes glutaredoxin (thioltransferase). 

This gene has LocusID: 2745, and is located on chromosome 5 with reported cytogenetic 
location 5ql4. The gene resides in genomic locus NT_023148 (NCBI Genome 
Annotation), Glutaredoxin can function as a glutathione-dependent hydrogen donor for 
ribonucleotide reductase. 

[0170] Fragments of CPS 95 also align with AF358259, a chromosomal region near 

GABRA6 and a chromosomal region near GLRXP with 86-90% sequence identity. 
AF358259 encodes glutaredoxin pseudogene 2, and has LocusID: 171418 with reported 
cytogenetic location 14q32.13-q32.2. GABRA6 encodes gamma-aminobutyric acid 
(GABA) A receptor, alpha 6. It has LocusID: 2559, and is located at chromosome 5q34. 
GLRXP encodes glutaredoxin (thioltransferase) pseudogene, and is located at chromosome 
20qll.2 with LocusID: 170522, In addition, nucleotides 1 to 29 of CPS 95 has 100% 
sequence identity with LOC257079. LOC257079 encodes a protein similar to glutaredoxin 
(Thioltransferase) (TTase), and is located on chromosome 5. 

[0171] CPS 96 corresponds to UNK_W28281 (GABARAPL1) which encodes 

GABA(A) receptor-associated protein like 1. This gene has LocusID: 23710, and is located 
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on chromosome 12 with reported cytogenetic location 12pl3.1. The gene resides in 
genomic locus NT_035207 (NCBI Genome Annotation). 

[0172] SEQ ID NO: 360, which can be used as a probe sequence for detecting the 

expression level of UNKW28281, also aligns with GABARAPL3 with about 95% 
sequence identity. GABARAPL3 encodes GABA(A) receptors associated protein like 3, It 
has LocusID: 23766, and is located on chromosome 15 with reported cytogenetic location 
15q25.1. 

[0173] CPS 97 corresponds to UNKAL096740 (UBE3B) which encodes ubiquitin 

protein ligase. This gene has LocusID: 89910, and is located on chromosome 12 with 
reported cytogenetic location 12q24.11. The gene resides in genomic locus NT_009770 
(NCBI Genome Annotation), 

[0174] CPS 98 corresponds to TBXAS1 which encodes thromboxane A synthase 1 

(platelet, cytochrome P450, subfamily V). This gene has LocusID: 6916, and is located on 
chromosome 7 with reported cytogenetic location 7q34-q35. The gene resides in genomic 
locus NT_007914 (NCBI Genome Annotation). The gene product is a member of the 
cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins are 
monooxygenases which catalyze many reactions involved in drug metabolism and synthesis 
of cholesterol, steroids and other lipids. TBXAS1 gene product is considered a member of 
the cytochrome P450 superfamily on the basis of sequence similarity rather than functional 
similarity. It is an endoplasmic reticulum membrane protein, and can catalyze the 
conversion of prostglandin H2 to thromboxane A2, a potent vasoconstrictor and inducer of 
platelet aggregation. TEXAS 1 gene product may play a role in several pathophysiological 
processes including hemostasis, cardiovascular disease, and stroke. The gene expresses at 
least two transcript variants. 

[0175] CPS 99 corresponds to DPYD which encodes dihydropyrimidine 

dehydrogenase. This gene has LocusID: 1806, and is located on chromosome 1 with 
reported cytogenetic location lp22. The gene resides in genomic locus NT_034389 (NCBI 
Genome Annotation). Dihydropyrimidine dehydrogenase is a pyrimidine catabolic enzyme 
which is involved in the initial and rate-limiting step in the pathway of uracil and thymidine 
catabolism and also in the pathway leading to the formation of beta-alanine. The DPYD 
gene is a large gene of approximately 150 kb consisting of at least 23 exons encoding a 
protein of approximately 1 1 1-kDa. Genetic deficiency of DPYD enzyme results in an error 
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in pyrimidine metabolism associated with thymine-uraciluria and an increased risk of 
toxicity in cancer patients receiving 5-flourouraciI chemotherapy. 

[0176] Affymetrix annotation suggests that CPS 100 corresponds to ECGF1 which 

encodes endothelial cell growth factor 1 (platelet-derived). Nucleotides 4 to 120 of M63193 
(SEQ ID NO: 94) align with ECGF1. This gene has LocusID: 1890, and is located on 
chromosome 22 with reported cytogenetic location 22ql3.33. The gene resides in genomic 
locus NT J)l 1 526 (NCBI Genome Annotation). , 

10177] CPS 100 aligns with SC02 with aft least 98% sequence identity. SC02 

encodes SCO cytochrome oxidase deficient homolog 2 (yeast). It has LocusID: 9997, and 
is located on chromosome 22 with reported cytogenetic location 22ql3.33. The gene 
resides in genomic locus NT 01 1526. Mammalian cytochrome c oxidase (COX) catalyzes 
the transfer of reducing equivalents from cytochrome c to molecular oxygen and pumps 
protons across the inner mitochondrial membrane. In yeast, two related COX assembly 
genes, yeast SCOl and SC02 (synthesis of cytochrome c oxidase), enable subunits 1 and 2 
to be incorporated into the holoprotein. SC02 is the human homolog of the yeast SC02 
gene. 

[0178] Affymetrix annotation suggests that CPS 101 corresponds to TIMP1 which 

encodes tissue inhibitor of metalloproteinase 1 (erythroid potentiating activity, collagenase 
inhibitor). The gene has LocusID: 7076, and is located at chromosome Xpl 1 .3-pl 1 .23. 
[0179] Blast search of the Entrez human genome sequence database shows that CPS 

101 aligns with an intron sequence of SYN1 with at least 97% sequence identity. SYN1 
encodes synapsin I, and has LocusID: 6853 with reported cytogenetic location Xpl 1.23. 
The gene resides in genomic locus NT_011568. CPS 101 matches with the non-protein- 
coding strand of SYN1 . 

[0180] CPS 102 corresponds to GSN which encodes gelsolin (amyloidosis, Finnish 

type). This gene has LocusID: 2934, and is located on chromosome 9 with reported 
cytogenetic location 9q33. The gene resides in genomic locus NTJ)17568 (NCBI Genome 
Annotation). Gelsolin is a calcium-dependent protein which may function to sever and cap 
actin filaments. 

[0181] CPS 103 corresponds to SECTM1 which encodes secreted and 

transmembrane 1. This gene has LocusID: 6398, and is located on chromosome 17 with 
reported cytogenetic location 17q25. The gene resides in genomic locus NT_02591 1 (NCBI 
Genome Annotation). The gene product is a transmembrane and secreted protein with 
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characteristics of a type la transmembrane protein. It is found in a perinuclear Golgi-like 
pattern and thought to be involved in hematopoietic or immune system processes. The gene 
product may have a role in hematopoiesis or immune processes. 

[0182] CPS 104 corresponds to OLR1 which encodes oxidised low density 

lipoprotein (lectin-like) receptor 1. This gene has LocusID: 4973, and is located on 
chromosome 12 with reported cytogenetic location 12pl3.2-pl2.3. The gene resides in 
genomic locus NTJ)35207 (NCBI Genome Annotation). Lectin-like oxidized low density 
lipoprotein receptor is a member of the C-type lectin receptor family, and may be involved 
in degradation of oxidized LDL by vascular endothelial cells. 

[0183] CPS 105 corresponds to D6S49E (LST1) which encodes leukocyte specific 

transcript 1. This gene has LocusID: 7940, and is located on chromosome 6 with reported 
cytogenetic location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI 
Genome Annotation). The gene product is expressed in leukocytes and induced by IFN- 
gamma. It possibly functions in the immune response of monocytes and T cells, 
[0184] CPS 106 corresponds to JUNB which encodes jun B proto-oncogene. This 

gene has LocusID: 3726, and is located on chromosome 19 with reported cytogenetic 
location 19pl3.2. The gene resides in genomic locus NT_011176 (NCBI Genome 
Annotation). The gene product may participate in AP-1 transcriptional activation. 
[0185] CPS 107 corresponds to PFC which encodes properdin P factor, 

complement. This gene has LocusID: 5199, and is located on chromosome X with reported 
cytogenetic location Xpl 1.3-pl 1.23. The gene resides in genomic locus NT_01 1568 (NCBI 
Genome Annotation). The gene product contains a related type-I repeat sequence, and may 
play a role in complement-mediated clearance. 

[0186] CPS 108 corresponds to POLR2E which encodes polymerase {RNA) II 

(DNA directed) polypeptide E (25kD). This gene has LocusID: 5434, and is located on 
chromosome 19 with reported cytogenetic location 19pl3.3. The gene resides in genomic 
locus NT_0 11277 (NCBI Genome Annotation). This gene encodes a subunit of RNA 
polymerase n, the polymerase responsible for synthesizing messenger RNA in eukaryotes. 
The encoded subunit is shared by the other two DNA-directed RNA polymerases and is 
present in two-fold molar excess over the other polymerase subunits. An interaction 
between this subunit and a hepatitis virus transactivating protein has been demonstrated, 
suggesting that interaction between transcriptional activators and the polymerase can occur 
through this subunit A pseudogene is located on chromosome 1 1 . 
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[0187] CPS 109 corresponds to THBS1 which encodes thrombospondin 1. This 

gene has LocusID: 7057, and is located on chromosome 15 with reported cytogenetic 
location 15ql5. The gene resides in genomic locus NT_030828 (NCBI Genome 
Annotation). Thrombospondin- 1 is a member of a family of adhesive molecules. It has a 
role in blood clotting and in angiogenesis. 

[0188] CPS 110 corresponds to HK2 which encodes hexokinase 2. This gene has 

LocusID: 3099, and is located on chromosome 2 \yith reported cytogenetic location 2pl3. 
The gene resides in genomic locus NTJJ22184 (NCBI Genome Annotation). Hexokinases 
phosphorylate glucose to produce glucose-6-phosphate, thus committing glucose to the 
glycolytic pathway. HK2 gene encodes hexokinase 2, the predominant form found in 
skeletal muscle. The gene product localizes to the outer membrane of mitochondria. 
Expression of this gene is insulin-responsive, and studies in rat suggest that it is involved in 
the increased rate of glycolysis seen in rapidly growing cancer cells. 
[0189] CPS 1 10 also aligns with a chromosomal region near LOC139132 with about 

95% sequence identity. Both the chromosomal region and LOCI 39132 reside in genomic 
locus NT_01 1597. LOC139132 encodes a protein similar to WW domain binding protein 

II (SH3 domain-binding protein SNP70) (Npw38-binding protein NpwBP), and is located 
atXql3.2. 

[0190] Asymetrix annotation suggests that CPS 111 corresponds to INSIG1 which 

encodes insulin induced gene 1. The gene has LocusID: 3638 and is located at chromosome 
7q36. 

[0191] Blast search of the Entrez human genome sequence database shows that CPS 

III has about 80-85% sequence identity with a chromosomal region near LOCI 3 1742. 
This chromosomal region and LOCI 3 1742 reside in genomic locus NT_022554. 
LOCI 3 1742 has reported cytogenetic location 3pl2.1-pl 1.2. 

[0192] CPS 112 corresponds to HCK which encodes hemopoietic cell kinase. This 

gene has LocusID: 3055, and is located on chromosome 20 with reported cytogenetic 
location 20qll-ql2. The gene resides in genomic locus NT_028392 (NCBI Genome 
Annotation). The gene product can function as a non-receptor protein tyrosine kinase. 
[0193] Affymetrix annotation suggests that CPS 113 corresponds to HP10390, alos 

known as TMEM4 which encodes transmembrane protein 4. The gene has LocusID: 10330, 
and is located at chromosome 12ql5. 
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[0194] Blast search of the Entrez human genome sequence database shows that CPS 

113 aligns with two regions with 100% sequence identity. The first region is located in an 
intron of TIMELESS. The second region is located 5' to the polypeptide-coding sequence 
of TIMELESS. TIMELESS encodes timeless homolog (Drosophila). It has LocusID: 
8914, and is located on chromosome 12 with reported cytogenetic location 12ql2-ql3. The 
gene resides in genomic locus NT_009458 (NCBI Genome Annotation). The gene product 
may be involved in circadian oscillation autoregulation. 

[0195] CPS 1 14 corresponds to UNKJJ51712 (LAGY) which encodes lung cancer- 

associated Y protein. This gene has LocusID: 84525, and is located on chromosome 4 with 
reported cytogenetic location 4qll-ql2. The gene resides in genomic locus NT_022853 
(NCBI Genome Annotation). Multiple alternatively spliced transcript variants have been 
described for this gene. 

[0196] CPS 115 corresponds to which encodes KIAA0217 which encodes 

KIAA0217 protein. This gene has LocusID: 23185, and is located on chromosome 10 with 
reported cytogenetic location 1 Op 15. 3. The gene resides in genomic locus NT_024115 
(NCBI Genome Annotation). 

[0197] CPS 116 corresponds to NCF1 which encodes neutrophil cytosolic factor 1 

(47kD, chronic granulomatous disease, autosomal 1). This gene has LocusID: 4687, and is 
located on chromosome 7 with reported cytogenetic location 7qll.23. The gene resides in 
genomic locus NT_034886 (NCBI Genome Annotation). NCF1 gene product can produce a 
burst of superoxide which is delivered to the lumen of the neutrophil phagosome. 
Mutations in NCF1, as well as in other NADPH oxidase subunits, can result in chronic 
granulomatous disease. 

[0198] CPS 116 also aligns with LOC220830 and LOC256379 with 95-99% 

sequence identity. Both genes reside in genomic locus NT_007758 on chromosome 7. 
Both genes encode proteins similar to neutrophil cytosolic factor 1 (47kD, chronic 
granulomatous disease, autosomal 1). 

[0199] CPS 117 corresponds to KIAA1113 (TRIM33) which encodes tripartite 

motif-containing 33. This gene has LocusID: 51592, and is located on chromosome 1 with 
reported cytogenetic location lpl3.L The gene resides in genomic locus NT_019273 
(NCBI Genome Annotation). The protein encoded by this gene is thought to be a 
transcriptional compressor. The protein is a member of the tripartite motif family. The 
tripartite motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box 
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type 2, and a coiled-coil region. At least three alternatively spliced transcript variants for 
this gene, have been described. 

[0200] CPS 118 corresponds to DKFZP566H073 which encodes DKFZP566H073 

protein. This gene has LocusID: 26001, and is located on chromosome 17 with reported 
cytogenetic location 17pl3.3. The gene. resides in genomic -locus NT_033299 (NCBI 
Genome Annotation). The gene product contains a C3HC4 type (RING) zinc finger, and 
can mediate protein-protein interactions. . 

[0201] CPS 119 corresponds to ATP2B1 which encodes ATPase, Ca-H- 

transporting, plasma membrane 1. This gene has LocusED: 490, and is located on 
chromosome 12 with reported cytogenetic location 12q21-q23. The gene resides in 
genomic locus NT_009729 (NCBI Genome Annotation). 

[0202] Nucleotides 305 to 338 of CPS 119 align with an intron sequence of 

FLJ14075 with 100% sequence identity. FLJ14075 encodes hypothetical protein FLJ14075, 
and has LocusID: 79954 with reported cytogenetic location 2p25.1. 

[0203] CPS 120 corresponds to IFITM2 which encodes interferon induced 

transmembrane protein 2 (1-8D). This gene has LocusID: 10581, and is located on 
chromosome 11 with reported cytogenetic location llpl5.5. The gene resides in genomic 
locus NT_009407 (NCBI Genome Annotation). The expression of this gene can be induced 
by interferon. 

[0204] CPS 121 corresponds to MNDA which encodes myeloid cell nuclear 

differentiation antigen. This gene has LocusID: 4332 , and is located on chromosome 1 
with reported cytogenetic location lq22. The gene resides in genomic locus NTJ)04982 
(NCBI Genome Annotation). The myeloid cell nuclear differentiation antigen can be 
detected in nuclei of cells of the granulocyte-monocyte lineage. A 200-amino acid region of 
the protein is similar to a region in the proteins encoded by a family of interferon-inducible 
mouse genes, designated Ifi-201, Ifi-202, and Ifi-203. The MNDA mRNA, which contains 
an interferon-stimulated response element in the 5-prime untranslated region, can be 
upregulated in human monocytes exposed to interferon alpha. MNDA is located within 
2,200 kb of FCER1A, APCS, CRP, and SPTAL In its pattern of expression or regulation, 
MNDA Tesembles IFI16, suggesting that these genes participate in blood cell-specific 
responses to interferons. 

[0205] CPS 122 corresponds to BTN3A2 which encodes butyrophilin, subfamily 3, 

member A2. This gene has LocusID: 1 1 1 1 8, and is located on chromosome 6 with reported 
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cytogenetic location 6p22.1. The gene resides in genomic locus NTJ)07592 (NCBI 
Genome Annotation). 

[0206] Nucleotides 213 to 250 of CPS 122 align with BTN3A3 and BTN3A1 with 

94% and 97% sequence identity, respectively. BTN3A3 encodes butyrophilin, subfamily 3, 
member A3, and has LocusID: 10384. BTN3A1 encodes butyrophilin, subfamily 3, 
member Al, and has LocusID: 1 1 1 19. Both genes are located at chromosome 6p22.1. 
[0207J CPS 123 corresponds to KIAA0776 which encodes KIAA0776 protein. This 

gene has LocusID: 23376, and is located on chromosome 6 with reported cytogenetic 
location 6ql6.3. The gene resides in genomic locus NTJH9424 (NCBI Genome 
Annotation). 

[0208] CPS 124 corresponds to D6S2245E (HSD17B8) which encodes 

hydroxysteroid (17-beta) dehydrogenase 8. This gene has LocusID: 7923, and is located on 
chromosome 6 with reported cytogenetic location 6p21.3. The gene resides in genomic 
locus NTJJ07592 (NCBI Genome Annotation). The protein encoded by this gene is similar 
to mouse Ke6 and is a member of the short-chain dehydrogenase superfamily. An 
alternatively spliced transcript of this gene has been detected. 

[0209] CPS 125 corresponds to KIAA0628. This gene has LocusID: 9831, and is 

located on chromosome 8 with reported cytogenetic location 8q24.3. The gene resides in 
genomic locus NTJJ23684 (NCBI Genome Annotation). CPS 125 is in the 3' UTR of the 
gene. 

[0210] CPS 126 corresponds to SELL which encodes selectin L (lymphocyte 

adhesion molecule 1). This gene has LocusID: 6402, and is located on chromosome 1 with 
reported cytogenetic location Iq23-q25. The gene resides in genomic locus NTJ)34405 
(NCBI Genome Annotation). Selectin L is a cell surface component that is a member of a 
family of adhesion/homing receptors which are involved in leukocyte-endothelial cell 
interactions. Selectin L is composed of multiple domains including one domain 
homologous to lectins, one to epidermal growth factor, and two to the consensus repeat 
units found in C3/C4 binding proteins. The protein can attach lymphocytes to lymph node 
high endothelial venules. 

[0211] CPS 127 corresponds to MPV17 which encodes MpV17 transgene, murine 

homolog, glomerulosclerosis. This gene has LocusID: 4358, and is located on chromosome 
2 with reported cytogenetic location 2p23-p21. The gene resides in genomic locus 
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NT_005204 (NCBI Genome Annotation). The gene product is similar to murine Mpvl 7. It 
is a predicted membrane protein and may be associated with nephrotic syndrome. 
[0212] CPS 128 corresponds to FABP5 which encodes fatty acid binding protein 5 

(psoriasis-associated). This gene has LocusID: 2171, and is located on chromosome 8 with 
reported cytogenetic location 8q2L13. The gene resides in genomic locus NT_007972 
(NCBI Genome Annotation). The gene product can be found in epidermal cells, and was 
identified as being upregulated in psoriasis tissue. Fatty acid binding proteins (FABPs) are 
a family of small, conserved, cytoplasmic proteins that bind long-chain fatty acids and other 
hydrophobic ligands. It is thought that FABPs roles include fatty acid uptake, transport, and 
metabolism. 

[0213] Nucleotides 1 to 260 of CPS 128 have 100% sequence identity with an intron 

sequence of STX3A. STX3A encodes syntaxin 3A, and has LocusID: 6809 with reported 
cytogenetic location llql2.3. STX3A resides in genomic locus NT_033903. The 
alignment between CPS 128 and STX3A is in the non-protein-coding of the gene. 
Nucleotides 1 to 260. of CPS 128, or fragments thereof, also align to various regions on 
chromosomes 7, 13 and 15 with 95-97% sequence identity. In addition, nucleotides 1-260 
of CPS 128, or fragments thereof, align to various region on chromosomes 2, 4, 13, 15 and 
22 with sequence identity 88-93%. 

[0214] CPS 129 corresponds to CASP1 which encodes caspase 1, apoptosis-related 

cysteine protease (interleukin 1, beta, convertase). This gene has LocusID: 834, and is 
located on chromosome 11 with reported cytogenetic location llq23. The gene resides in 
genomic locus NTJ)09151 (NCBI Genome Annotation). The gene product is a member of 
the cysteine-aspartic acid protease (caspase) family. Sequential activation of caspases plays 
a central role in the execution-phase of cell apoptosis. Caspases exist as inactive 
proenzymes which undergo proteolytic processing at conserved aspartic residues to produce 
2 subunits, large and small, that dimerize to form the active enzyme. The CASP1 gene 
product can proteolytically cleave and activate the inactive precursor of interlukin-1, a 
cytokine involved in the processes such as inflammation, septic shock, and wound healing. 
CASP1 gene has been shown to induce cell apoptosis and may function in various 
developmental stages. It may have a role in the pathogenesis of Huntington disease. 
Alternative splicing of this gene results in at least five transcript variants encoding distinct 
isoforms. 
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[0215] Nucleotides 1 to 487 of CPS 129, or fragments thereof, aligns with various 

chromosome 11 regions with sequence identity 86-95%. These regions include 
LOCI 20332 and LOC160131, both of which have reported cytogenetic location llq22.3 
and encode proteins similar to caspase 1, isoform beta precursor {interleukin 1-beta 
convertase) (interleukin l-B converting enzyme) (ILlB-convertase), 
[0216] CPS 1 30 corresponds to PTPN6 which encodes protein tyrosine phosphatase, 

non-receptor type 6. This gene has LocusID: 5777,^ and is located on chromosome 12 with 
reported cytogenetic location 12pl3. The gene resides in genomic locus NT_035206 (NCBI 
Genome Annotation). The protein encoded by this gene is a member of the protein tyrosine 
phosphatase (PTP) family. PTPs are known to be signaling molecules that regulate a 
variety of cellular processes including cell growth, differentiation, mitotic cycle, and 
oncogenic transformation. N-terminal part of PTPN6 gene product contains two tandem Src 
homolog (SH2) domains, which act as protein phospho-tyrosine binding domains and 
mediate the interaction of the gene product with its substrates. PTPN6 gene product can be 
expressed in hematopoietic cells, and may function as a regulator of multiple signaling 
pathways in hematopoietic cells. The gene product has been shown to interact with, and 
dephosphorylate a wide spectrum of phospho-proteins involved in hematopoietic cell 
signaling. At least three alternatively spliced variants of this gene, which encode distinct 
isoforms, have been reported. 

[0217] CPS 1 30 also aligns with a chromosomal region near REA with at least 98% 

sequence identity. REA encodes repressor of estrogen receptor activity. It has LocusID: 
1 1331 and reported cytogenetic location 12pl3. 

[0218] CPS 131 corresponds to SKAP-HOM (SCAP2) which encodes src family 

associated phosphoprotein 2. This gene has LocusID: 8935, and is located on chromosome 
7 with reported cytogenetic location 7p21-pl5. The gene resides in genomic locus 
NT_007819 (NCBI Genome Annotation). The protein encoded by this gene belongs to the 
src family kinases. The encoded protein is similar to the src family associated 
phosphoprotein 1 and may function as an adaptor protein. The encoded protein has coiled- 
coil and SH3 domains, and is thought to play a role in the src signaling pathway in various 
cells. In a mouse study the SCAP2 gene is implicated in the processes of myeloid 
differentiation and growth arrest. 

[0219] CPS 133 corresponds to GR02 which encodes GR02 oncogene. This gene 

has LocusID: 2920, and is located on chromosome 4 with reported cytogenetic location 
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4q21. The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). GR02 
is similar to GROl. The gene product is a chemotactic agent for polymoiphonuclear 
leukocytes. 

[0220] CPS 133 also aligns with GROl with 85% sequence identity. GROl 

encodes GROl oncogene (melanoma growth stimulating activity, alpha), and has LocusH): 
2919 with reported cytogenetic location 4q21. The gene resides in genomic locus 
NT 006216. 

[0221] CPS 134 corresponds to RRAS which encodes related RAS viral (r-ras) 

oncogene homolog. This gene has LocusID: 6237, and is located on chromosome 19 with 
reported cytogenetic location 19ql3.3-qter. The gene resides in genomic locus NT_011109 
(NCBI Genome Annotation). . The gene product is a member of the ras family of GTP 
binding proteins. 

[0222] CPS 1 35 corresponds to KIAA0022 which encodes KIAA0022 gene product. 

This gene has LocusID: 9936, and is located on chromosome 2 with reported cytogenetic 
location 2q24.2. The gene resides in genomic locus NT_005151 <NCBI Genome 
Annotation). 

[0223] CPS 136 corresponds to FLII which encodes flightless I homolog 

(Drosophila). This gene has LocusID: 2314, and is located on chromosome 17 with 
reported cytogenetic location 17pll.2. the gene resides in genomic locus NT_030843 
(NCBI Genome Annotation). The gene product is a homolog of Drosophila flightless L It 
has a gelsolin-like actin-binding domain and a leucine-rich interaction domain. 
[0224] CPS 137 corresponds to PRF1 which encodes perforin 1 (pore forming 

protein). This gene has LocusID: 5551, and is located on chromosome 10 with reported 
cytogenetic location 10q22. The gene resides in genomic locus NT_024037 (NCBI 
Genome Annotation). Perforin is a cytolytic, channel-forming protein which plays a role in 
clearing virally infected host cells and tumor cells. 

[0225] CPS 138 corresponds to KIAA0102 which encodes KIAA0102 gene product 

This gene has LocusID: 9789, and is located on chromosome 11 with reported cytogenetic 
location llql3.3. The gene resides in genomic locus NT_033927 (NCBI Genome 
Annotation). 

[0226] CPS 138 also aligns with a chromosomal region near PTAFR with 99% 

sequence identity. PTAFR encodes platelet-activating factor receptor, and has LocusID: 
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5724 and reported cytogenetic location Ip35-p34.3. It resides in genomic locus 
NT_028053. Platelet-activating factor receptor is a G protein-coupled receptor. 



chromosomes 5, 10, 11 and 15. For instance, CPS 138 aligns with a chromosome 15 region 
with 94% sequence identity. The chromosome 15 region includes LOC255320 which 
encodes a protein similar to microsomal signal peptidase 25 kDa subunit (SPase 25 kDa 
subunit) (SPC25). , 



[0228] CPS 139 corresponds to UNK_AA176780 (HSA249128) which encodes 

DIPB protein. This gene has LocusID: 54765, and is located on chromosome 11 with 
reported cytogenetic location lip 11.2. The gene resides in genomic locus NT_009237 
(NCBI Genome Annotation). The gene product is member of the B-box zinc finger family, 
and contains a region of low similarity to a region of murine Mid2, CPS 139 aligns with the 
region 3' to the protein-coding strand of UNK_AA1 76780. 

[0229] CPS 140 corresponds to PYGL which encodes phosphorylase, glycogen; 

liver (Hers disease, glycogen storage disease type VI). This gene has LocusID: 5836, and is 
located on chromosome 14 with reported cytogenetic location 14q21-q22. The gene resides 
in genomic locus NTJ)25892 (NCBI Genome Annotation). 

[0230] CPS 141 corresponds to DBP which encodes D site of albumin promoter 

(albumin D-box) binding protein. This gene has LocusID: 1628, and is located on 
chromosome 19 with reported cytogenetic location 19ql3.3. The gene resides in genomic 
locus NT_011109 (NCBI Genome Annotation). Albumin D-site-binding protein is a 
transcription factor which may play a role in the diurnal regulation of liver-specific genes. 
It is a member of the PAR (proline and acidic amino acid-rich) b/ZIP family. 
[0231] CPS 143 corresponds to RANBP2L1 which encodes RAN binding protein 2- 

like 1. This gene has LocusID: 84220, and is located on chromosome 2 with reported 
cytogenetic location 2ql2.3. The gene resides in genomic locus NT_022135 (NCBI 
Genome Annotation). RAN is a small GTP-binding protein of the RAS superfamily that is 
associated with the nuclear membrane and is thought to control a variety of cellular 
functions through its interactions with other proteins. RANBP2L1 gene shares sequence 
similarity with RANBP2, a large RAN-binding protein localized at the cytoplasmic side of 
the nuclear pore complex. It is believed that this RANBP2 gene family member arose from 
a duplication event 3 Mb distal to RANBP2. Alternative splicing has been observed for this 
locus and two variants are described. Additional splicing is suggested. 



[0227J 



In addition, CPS 138 has 86-94% sequence identity with various regions in 
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[0232] CPS 143 also aligns with LOC220692 with 100% sequence identity. 

LOC220692 encodes protein similar to RAN-binding protein 2-like 1, isoform 1 (spenn 
membrane protein BS-63). The gene is located at chromosome 2ql2.3 and resides in 
genomic locus NT_0221 35 (NCBI Genome Annotation). 

[0233] In addition, CPS 143 has 94-96% sequence identity with KIAA0336, a 

chromosomal region near LOC256197, and an intron of LOC15082L KIAA0336 encodes 
KIAA0336 gene product, and has LocusID: 9648 and reported cytogenetic location 2ql2.2. 
LOC256197 encodes a protein similar to ribosomal protein L22, and is located on 
chromosome 2. LOCI 50821 encodes a protein similar to KIAA0336 gene product, and is 
located at chromosome 2pl 1.1. 

[0234] CPS 144 corresponds to TNFSF10 which encodes tumor necrosis factor 

(ligand) superfamily, member 10. This gene has LocusID: 8743, and is located on 
chromosome 3 with reported cytogenetic location 3q26. The gene resides in genomic locus 
NTJJ25667 (NCBI Genome Annotation). Tumor necrosis factor (TNF) family cytokines 
function as mediators.of immune regulation and the inflammatory response. TNFSF10 gene 
product is a Type II glycoprotein of the tumor necrosis factor ligand superfamily, and can 
mediate cell death. 

[0235] CPS 145 corresponds to KIAA0854 which encodes KIAA0854 protein. This 

gene has LocusID: 22882, and is located on chromosome 8 with reported cytogenetic 
location 8q24.13, The gene resides in genomic locus NTJ)23663 (NCBI Genome 
Annotation). 

[0236] CPS 146 corresponds to SDS which encodes serine dehydratase. This gene 

has LocusID: 10993, and is located on chromosome 12 with reported cytogenetic location 
12q24.12. The gene resides in genomic locus NT_009601 (NCBI Genome Annotation). 
Serine dehydratase catalyzes the PLP-dependent alpha,beta-elinunation of L-serine to 
pyruvate and ammonia. It is one of three enzymes that are regarded as metabolic exits of 
the serine-glycine pool. Serine dehydratase can be found in the liver. 
[0237] CPS 147 corresponds to KIAA0576 (COASTER) which encodes coactivator 

for steroid receptors. This gene has LocusID: 26036, and is located on chromosome 6 with 
reported cytogenetic location 6pll.l. The gene resides in genomic locus NT_007592 
(NCBI Genome Annotation). 
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[0238] Asymetrix annotation suggests that CPS 148 corresponds to FCGR3B which 

encodes Fc fragment of IgG, low affinity nib, receptor for (CD\6). The gene has LocusID: 
221 5, and is located at chromosome 1 q23. 

[0239] Blast search of the Entrez human genome sequence database shows that CPS 

148 aligns with FCGR3A with at least 97% sequence identity. FCGR3A encodes Fc 
fragment of IgG, low affinity Ilia, receptor for (CD16). The gene has LocusID: 2214, and 
is located on chromosome 1 with reported cytogenetic location lq23. The gene resides in 
genomic locus NTJ)04668 (NCBI Genome Annotalion). The gene product is a Type in Fc 
gamma receptor and a member of the immunoglobulin superfamily. It can associate with 
zeta chain of the T-cell receptor complex (CD3Z). 

[0240] CPS 149 corresponds to FCER1A which encodes Fc fragment of IgE, high 

affinity I, receptor for; alpha polypeptide. The gene product is the alpha subunit of the high 
affinity IgE receptor, and may be involved in triggering allergic responses. This gene has 
LocusID: 2205, and is located on chromosome 1 with reported cytogeijetic location lq23. 
The gene resides in genomic locus NT_004982 (NCBI Genome Annotation). The IgE 
receptor contains 3 subunits: alpha, beta (MIM 147138 ), and gamma (MIM 147139 ). The 
alpha subunit can be glycosylated 

[0241] CPS 150 corresponds to CD44 which encodes CD44 antigen (homing 

function and Indian blood group system). This gene has LocusID: 960, and is located on 
chromosome 11 with reported cytogenetic location llpl3. The gene resides in genomic 
locus NT_009237 (NCBI Genome Annotation). 

[0242] CPS 151 corresponds to ID1 which encodes inhibitor of DNA binding 1, 

dominant negative helix-loop-helix protein. This gene has LocusID: 3397, and is located on 
chromosome 20 with reported cytogenetic location 20qlL The gene resides in genomic 
locus NT_028392 (NCBI Genome Annotation). The gene product is a member of the Id 
helix-loop-helix family of proteins, and may negatively regulate cell differentiation. 
[0243] Blast search of the Entrez human genome sequence database shows that CPS 

152 overlaps KIAA0963. CPS 152 aligns with the non-protein-coding strand of 
KIAA0963. KIAA0963 encodes KIAA0963 protein, and has LocusID: 22904 with reported 
cytogenetic location 19pl3.3. KIAA0963 resides in genomic locus NTJ)11277 (NCBI 
Genome Annotation). 

[0244] CPS 153 corresponds to ADTB2 (AP2B1) which encodes adaptor-related 

protein complex 2, beta 1 subunit. This gene has LocusID: 163, and is located on 
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chromosome 17 with reported cytogenetic location 17qll.2-ql2. The gene resides in 
genomic locus NTJH0799 (NCBI Genome Annotation). The beta adaptin subunit is part of 
the clathrin coat assembly complex which links clathrin to receptors in coated pits and 
vesicles. These vesicles are involved in endocytosis and Golgi processing. The beta 1 
subunit is one of the assembly proteins which binds to clathrin and initiates coat formation. 
[0245] CPS 154 corresponds to MADH3 which encodes MAD, mothers against 

decapentaplegic homolog 3 (Drosophila). This gene has LocusID: 4088, and is located on 
chromosome 15 with reported cytogenetic location 15q21-q22. The gene resides in 
genomic locus NT_010265 (NCBI Genome Annotation). The gene product is similar to 
murine Madh3. It is a member of the Smad family of proteins, and may affect transcription 
in response to TGF-beta signaling pathways. 

[0246] CPS 155 corresponds to ARPC1B which encodes actin related protein 2/3 

complex, subunit IB (41 kD). This gene has LocusID: 10095* and is located on 
chromosome 7 with reported cytogenetic location 7qll.21. The gene resides in genomic 
locus NT_007933 (NCBI Genome Annotation). The gene product is involved in assembly 
of the actin cytoskeleton, and may have a role in protrusion of lamellipodia, 
[0247] CPS 155 also aligns with two chromosome 12 regions with 99-100% 

sequence identity. These two regions reside in genomic locus NTJB5283 (NCBI Genome 
Annotation). The first region is located near LOCI 96489 which encodes a protein similar 
to FLJ00209 protein. The second region is located near LOC144584 which encodes a 
protein similar to vacuolar protein sorting 35 (yeast); maternal-embryonic 3. Both 
LOC196489 and LOC144584 have reported cytogenetic location 12ql3.12. 
[0248] Affymetrix annotation suggests that CPS 156 corresponds to TALDOl 

which encodes transaldolase 1 and has LocusID: 6888. The gene is located at chromosome 
Ilpl5.5-pl5.4. 

[0249] CPS 157 corresponds to UNK_AL035079 (CAT) which encodes catalase. 

This gene has LocusID: 847, and is located on chromosome 11 with reported cytogenetic 
location llpl3. The gene resides in genomic locus NTJ)09237 (NCBI Genome 
Annotation). Catalase is a tetrameric hemoprotein that can detoxify hydrogen peroxide. 
[0250] CPS 158 corresponds to STXBP1 which encodes syntaxin binding protein 1. 

This gene has LocusID: 6812, and is located on chromosome 9 with reported cytogenetic 
location 9q34.1. The gene resides in genomic locus NTJ)29366 (NCBI Genome 
Annotation). 
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[0251] CPS 159 corresponds to UNG2 which encodes uracil-DNA glycosylase 2. 

This gene has LocusID: 10309, and is located on chromosome 5 with reported cytogenetic 
location 5pl5.2-pl3.1. The gene resides in genomic locus NT_006431 (NCBI Genome 
Annotation). Uracil-DNA glycosylase 2 possesses uracil-DNA glycosylase activity. 
[0252] CPS 160 corresponds to DUSP5 which encodes dual specificity phosphatase 

5. This gene has LocusID: 1847, and is located on chromosome 10 with reported 
cytogenetic location 10q25. The gene resides in genomic locus NT_030059 (NCBI 
Genome Annotation). The protein encoded by DUSP5 gene is a member of the dual 
specificity protein phosphatase subfamily. These phosphatases can inactivate their target 
kinases by dephosphorylating both the phosphoserine/threonine and phosphotyrosine 
residues. They can negatively regulate members of the mitogen-activated protein (MAP) 
kinase superfamily (MAPK/ERK, SAPK/JNK, p38), which are associated with cellular 
proliferation and differentiation. Different members of the family of dual specificity 
phosphatases show distinct substrate specificities for various MAP kinases, different tissue 
distribution and subcellular localization, and different modes of inducibility of their 
expression by extracellular stimuli. DUSP5 gene product can inactivate ERK1, and is 
expressed in a variety of tissues including pancreas and brain. 

[0253] CPS 161 corresponds to PTPRO which encodes protein tyrosine 

phosphatase, receptor type, O. This gene has LocusID: 5800, and is located on 
chromosome 12 with reported cytogenetic location 12pl3.3-pl3.2. The gene resides in 
genomic locus NTJ)09714 (NCBI Genome Annotation). The gene product is an integral 
membrane protein containing a transmembrane domain and an intracellular catalytic domain 
with a characteristic signature motif. Several alternatively spliced transcript variants, some 
of which encode different isoforms of the protein, have been described. The gene product 
also contains fibronectin type Hi-like repeats and putative glycosylation sites in the 
extracellular domain. 

[0254] Aflymetrix annotation suggests that CPS 162 corresponds to SLA which 

encodes Src-like-adaptor and has LocusID: 6503. The gene has reported cytogenetic 
location at chromosome 8q24. 

[0255] Blast search of the Entrez human genome sequence database shows that CPS 

162 aligns with an intron sequence of TG with at least 99% sequence identity. TG encodes 
thyroglobulin, and has LocusID: 7038 with reported cytogenetic location 8q24.2-q24.3. 
The gene resides in genomic locus NTJ)08150 (NCBI Genome Annotation). CPS 162 
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matches with the non-protein-coding strand of TG. Thyroglobulin is a precursor of thyroid 
hormones. 

[02561 CPS 163 corresponds to RFP2 which encodes ret finger protein 2. This gene 

has LocusID: 10206, and is located on chromosome 13 with reported cytogenetic location 
13ql4. The gene resides in genomic locus NT_033922 (NCBI Genome Annotation). The 
protein encoded by RFP2 gene is a member of the tripartite motif (TRIM) family. The 
TRIM motif includes three zinc-binding domains, a RING, a B-box type 1 and a B-box type 
2, and a coiled-coil region. The RFP2 gene product localizes to cytoplasmic bodies near the 
nucleus. The gene is located on chromosome 13 within the minimal deletion region for B- 
cell chronic lymphocytic leukemia. Alternative splicing of this gene has been described. 
[0257] Nucleotides 3 to 255 of CPS 163 have 83% sequence identity with a 

chromosomal region near LOCI 62 162. LOCI 62 162 encodes a protein similar to 
spinocerebellar ataxia type 1, and has reported cytogenetic location 16q23.1. 
[0258] CPS 1 64 corresponds to ADFP which encodes adipose differentiation-related 

protein (adipophilin).. This gene has LocusID: 123, and is located on chromosome 9 with 
reported cytogenetic location 9p21.2, The gene resides in genomic locus NTJ)23974 
(NCBI Genome Annotation). Adipocyte differentiation-related protein is associated with 
the globule surface membrane material. The protein is a major constituent of the globule 
surface. Increase in mRNA levels is one of the earliest indications of adipocyte 
differentiation. The product is also a component of milk lipid globules, 
[0259] CPS 164 aligns with a chromosome 1 region near LOC254424 with 94% 

sequence identity. LOC254424 encodes a protein similar to adipophilin (adipose 
differentiation-related protein) (ADRP). 

[0260] CPS 165 corresponds to ADTD (AP3D1) which encodes adaptor-related 

protein complex 3, delta 1 subunit This gene has LocusID: 8943, and is located on 
chromosome 19 with reported cytogenetic location 19pl3.3. The gene resides in genomic 
locus NT_011268 (NCBI Genome Annotation). Delta-adaptin is a component of the AP-3 
complex which is involved in intracellular transport. 

[0261] Asymetrix annotation suggests that CPS 166 corresponds to ADTG which is 

also known as AP1G1. The gene encodes adaptor-related protein complex 1, gamma 1 
subunit. The gene has LocusID: 164, and is located at chromosome 16q23. 
[0262] Blast search of the Entrez human genome sequence database shows that CPS 

166 aligns to LOC255980 with at least 99% sequence identity. LOC255980 encodes a 
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protein similar to hypothetical protein FLJ20151, and is located on chromosome 15. 
LOC255980 resides in genomic locus NT_010265. It overlaps FLJ20151 which has 
LocusID: 54837 and reported cytogenetic location 15q21.3. 

[0263] CPS 168 corresponds to UNK_X87344 (PSMB8) which encodes proteasome 

(prosome, macropain) subunit, beta type, 8 (large multifunctional protease 7). This gene 
has LocusID: 5696, and is located on chromosome 6 with reported cytogenetic location 
6p21.3. The gene resides in genomic locus NT_007 ( 592 (NCBI Genome Annotation). Beta 
subunit 8 of the proteasome (prosome macropain) 'can replace beta subunit PSMB5 when 
cells are stimulated by interferon gamma, thereby altering proteasome specificity. 
[0264] CPS 170 corresponds to CAPG which encodes capping protein (actin 

filament), gelsolin-like. This gene has LocusID: 822, and is located on chromosome 2 with 
reported cytogenetic location 2cen-q24, The gene resides in genomic locus NT_015805 
(NCBI Genome Annotation). The gene product is a macrophage capping protein. It can 
reversibly block the barbed ends. The gene product is a member of the gelsolin/villin 
protein family. 

[0265] CPS 171 corresponds to ARL7 which encodes ADP-ribosylation factor-like 

7. This gene has LocusID: 10123, and is located on chromosome 2 with reported 
cytogenetic location 2q37.2. The gene resides in genomic locus NT_005414 (NCBI 
Genome Annotation). ADP-ribosylation factor-like 7 (ARL7) is a member of the ADP- 
ribosylation factor family of GTP-binding proteins. ARL7 is similar to ARL4 and ARL6, 
and each has a nuclear localization signal and a high guanine nucleotide exchange rate. 
[0266] CPS 172 corresponds to RCV1 which encodes recoverin. This gene has 

LocusID: 5957, and is located on chromosome 17 with reported cytogenetic location 
17pl3.1. The gene resides in genomic locus NT_010718 (NCBI Genome Annotation). 
Recoverin is a calcium-binding protein which can activate guanylate cyclase activity. 
[0267] CPS 173 coiresponds to GR03 which encodes GR03 oncogene. This gene 

has LocusID: 2921, and is located on chromosome 4 with reported cytogenetic location 
4q21. The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). GR03 
is similar to human GROl, and may encode a mitogenic factor. 

[0268] CPS 175 coiresponds to UNKAF070606 (ATP2B1) which encodes 

ATPase, Ca-H- transporting, plasma membrane 1. This gene has LocusID: 490, and is 
located on chromosome 12 with reported cytogenetic location 12q21-q23. The gene resides 
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in genomic locus NT_009729 (NCBI Genome Annotation). CPS 175 aligns with the 3' 
UTR of the protein-coding strand of the gene. 

[0269] CPS 176 corresponds to AO AH which encodes acyloxyacyl hydrolase 

(neutrophil). This gene has LocusID: 313, and is located on chromosome 7 with reported, 
cytogenetic location 7pl4-pl2. The gene resides in genomic locus NT_007819 (NCBI 
Genome Annotation). Acyloxyacyl hydrolase is a 2-subunit lipase which can hydrolyze the 
secondary (acyloxyacyl-linked) fatty acyl chains from the lipid A region of bacterial 
endotoxins. Acyloxyacyl hydrolase may modulate host inflammatory responses to gram- 
negative bacterial invasion. The 2 subunits are encoded by a single mRNA. 
[0270] CPS 177 corresponds to UNKAJ224442 (WBSCR22) which encodes 

Williams Beuren syndrome chromosome region 22.; This gene has LocusID: 1 14049, and is 
located on chromosome 7. The gene resides in genomic locus NT_007758 (NCBI Genome 
Annotation). 

[0271] CPS 178 corresponds to MSR1 which encodes macrophage scavenger 

receptor 1. This gene has LocusID: 4481, and is located on chromosome 8 with reported 
cytogenetic location 8p22. The gene resides in genomic locus NTJ) 15280 (NCBI Genome 
Annotation). This gene encodes the class A macrophage scavenger receptors, which 
include at least three different types (1, 2, 3) generated by alternative splicing of this gene. 
These receptors or isoforms are macrophage-specific trimeric integral membrane 
glycoproteins and have been implicated in many macrophage-associated physiological and 
pathological processes including atherosclerosis, Alzheimer's disease, and host defense. 
The isoforms type 1 and type 2 are functional receptors and are able to mediate the 
endocytosis of modified low density lipoproteins (LDLs). The isoform type 3 may have an 
altered intracellular processing and can be trapped within the endoplasmic reticulum. The 
isoform type 3 can inhibit the function of isoforms type 1 and type 2 when co-expressed, 
indicating a dominant negative effect and suggesting a mechanism for regulation of 
scavenger receptor activity in macrophages. 

[0272] CPS 179 corresponds to XAP4 (C20orfl8) which encodes chromosome 20 

open reading frame 18. This gene has LocusID: 10616, and is located on chromosome 20 
with reported cytogenetic location 20pl3. The gene resides in genomic locus NT_011387 
(NCBI Genome Annotation). The gene contains at least 12 exons. Several alternatively 
spliced transcript variants have been described. The gene product is similar to mouse 



S7 



WO 2004/072265 



PCT/US2004/004118 



UIP28/UbcM4 interacting protein at the amino acid level, and contains a C3HC4 type 
(RING) zinc finger. It may interact with PKC and mediate protein-protein interactions. 
[0273J CPS 180 corresponds to C1QR1 which encodes complement component 1, q 

subcomponent, receptor 1. This gene has LocusID: 22918, and is located on chromosome 
20 with reported cytogenetic location 20pll.21. The gene resides in genomic locus 
NT_01 1387 (NCBI Genome Annotation). The gene product is a type I membrane protein 
and can act as a receptor for complement protein Clq, rnannose-binding lectin, and 
pulmonary surfactant protein A. The gene product is a functional receptor involved in 
ligand-mediated enhancement of phagocytosis. 

[0274] CPS 181 corresponds to STAT4 which encodes signal transducer and 

activator of transcription 4. This gene has LocusID: 6775, and is located on chromosome 2 
with reported cytogenetic location 2q32.2-q32.3. The gene resides in genomic locus 
NTJ)22197 (NCBI Genome Annotation). The gene product is a member of the STAT 
family of transcription factors. In response to cytokines and growth factors, STAT family 
members may be phosphorylated by the receptor associated kinases, and then form homo- 
or heterodimers that translocate to the cell nucleus where they act as transcription activators, 
STAT4 gene product may be involved in mediating responses to IL12 in lymphocytes and 
regulating the differentiation of T helper cells. 

[0275] CPS 182 corresponds to UNK_AL049309 (SFRS12) which encodes splicing 

factor, arginine/serine-rich 12. This gene has LocusID: 140890, and is located on 
chromosome 5 with reported cytogenetic location 5ql2.3. The gene resides in genomic 
locus NT_00643 1 (NCBI Genome Annotation). 

[0276] CPS 183 corresponds to RNASE2 which encodes ribonuclease, RNase A 

family, 2 (liver, eosinophil-derived neurotoxin). This gene has LocusID: 6036, and is 
located on chromosome 14 with reported cytogenetic location 14q24-q31. The gene resides 
in genomic locus NT_025892 (NCBI Genome Annotation). The gene product is a member 
of ribonuclease superfamily, and has neurotoxic and ribonuclease activities. 
[0277] CPS 183 also aligns with LOC122661 and RNASE3 with about 86-93% 

sequence identity. LOCI 22661 encodes a protein similar to nonsecretory ribonuclease 
precursor (Ribonuclease US) (Eosinophil-derived neurotoxin) (RNase UpI-2) (Ribonuclease 
2) (RNase 2). It is located at chromosome 14qll.l, and resides in genomic locus 
NT_025892. RNASE3 encodes ribonuclease, RNase A family, 3 (eosinophil cationic 
protein). RNASE3 has LocusID: 6037 and reported cytogenetic location 14q24-q31. It also 
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resides in genomic locus NT_025892. RNASE3 gene product is a member of the 
ribonuclease superfamily, and has both neurotoxic and ribonuclease activities. 
[0278] CPS 184 corresponds to NCF4 which encodes neutrophil cytosolic factor 4 

(40kD). This gene has LocusID: 4689, and is located on chromosome 22 with reported 
cytogenetic location 22ql3.1. The gene resides in genomic locus NT_011520 (NCBI 
Genome Annotation). The protein encoded by this gene is a cytosolic element of. 
nicotinamide adenine dinucleotide phosphate-oxidase. Upon neutrophil stimulation, the 
encoded protein and other cytosolic elements are sent to the cell membrane from the cytosol 
to form a complex which produces phagocytic oxygen radicals. Two motifs found in this 
protein, an SH3 domain and a PC motif, are significant to protein-protein interactions. 
Through interaction with the SH3 domain, NCF4 gene product is responsible for the 
downregulation of NADPH-oxidase. Alternative splicing has been observed in this gene. 
CPS 184 aligns to the 3' UTR of the gene. 

[0279] CPS 185 corresponds to ANXA4 which encodes annexin A4. This gene has 

LocusID: 307, and is located on chromosome 2 with reported cytogenetic location 2pl3. 
The gene resides in genomic locus NT_022184 (NCBI Genome Annotation). Annexin A4 
belongs to the annexin family of calcium-dependent phospholipid binding proteins. Several 
members of the annexin family have been implicated in membrane-related events along 
exocytotic and endocytotic pathways. Annexin A4 may interact with ATP, have in vitro 
anticoagulant activity, and inhibit phospholipase A2 activity. Annexin A4 can be detected 
in epithelial cells. 

[0280] CPS 186 corresponds to ME2 which encodes malic en2yme 2, NAD(+)- 

dependent, mitochondrial. This gene has LocusID: 4200, and is located on chromosome 18 
with reported cytogenetic location 18q21. The gene resides in genomic locus NTJ)33905 
(NCBI Genome Annotation). The gene product is a homotetrameric protein which can 
catalyze the oxidative decarboxylation of malate to pyruvate. 

[0281] CPS 183 also has 91% sequence identity with a chromosome 9 region near 

LOC169570. LOC169570 has reported cytogenetic location 9pl3.1. 
[0282] CPS 187 corresponds to IFI35 which encodes interferon-induced protein 35. 

This gene has LocusID: 3430, and is located on chromosome 17 with reported cytogenetic 
location 17q21. The gene resides in genomic locus NT_035490 (NCBI Genome 
Annotation). Interferon-induced protein 35 associates with B-ATF transcription factor on 
interferon treatment. It contains a leucine-zipper motif. 

89 



WO 2004/072265 



PCT/US2004/004118 



[0283] CPS 188 corresponds to VAMP5 which encodes vesicle-associated 

membrane protein 5 (myobrevin). This gene has LocusID: 10791, and is located on 
chromosome 2 with reported cytogenetic location 2pll.2. The gene resides in genomic 
locus NT_015805 (NCBI Genome Annotation). The gene product is a member of the 
synaptobrevin/VAMP family of proteins. 

[0284] CPS 188 also has 87% sequence identity with an intron sequence of NSF. 

NSF encodes N-ethylmaleimide-sensitive factor. It has LocusID: 4905 and reported 
cytogenetic location 17q21. 

[0285] CPS 189 corresponds to IMPA2 which encodes inositol(myo>l(or 4> 

monophosphatase 2. This gene has LocusID: 3613, and is located on chromosome 18 with 
reported cytogenetic location 18pll.2. The gene resides in genomic locus NT_010859 
(NCBI Genome Annotation). 

[0286] CPS 190 corresponds to GZMK which encodes granzyme K (serine protease, 

granzyme 3; tryptase II). This gene has LocusID: 3003, and is located on chromosome 5 
with reported cytogenetic location 5qll-ql2. The gene resides in genomic locus 
NT_006431 (NCBI Genome Annotation). This gene product is a member of a group of 
related serine proteases from the cytoplasmic granules of cytotoxic lymphocytes. Cytolytic 
T lymphocytes (CTL) and natural killer (NK) cells can recognize, bind, and lyse specific 
target cells. They are thought to protect theiFhost by lysing cells bearing on their surface 
"nonself antigens, usually peptides or proteins resulting from infection by intracellular 
pathogens.i 

[0287] CPS 191 corresponds to BGN which encodes biglycan. This gene has 

LocusID: 633, and is located on chromosome X with reported cytogenetic location Xq28. 
The gene resides in genomic locus NT_025965 (NCBI Genome Annotation). The protein 
encoded by this gene is a small cellular or pericellular matrix proteoglycan that is related in 
structure to two other small proteoglycans, decorin and fibromodulin. The encoded protein 
and decorin are thought to be the result of a gene duplication. Decorin contains one 
attached glycosaminoglycan chain, while biglycan probably contains two chains. Biglycan 
is thought to function in connective tissue metabolism by binding to collagen fibrils and 
transfering growth factor-beta. It may promote neuronal survival. This gene is a candidate 
gene for the Happle syndrome. 

[0288] CPS 192 corresponds to AIF1 which encodes allograft inflammatory factor 

1. This gene has LocusID: 199, and is located on chromosome 6 with reported cytogenetic 
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location 6p21.3. The gene resides in genomic locus NT_007592 {NCBI Genome 
Annotation). This gene is induced by cytokines and interferon. Its protein product is 
thought to be involved in negative regulation of growth of vascular smooth muscle cells, 
which contributes to the anti-inflammatory response to vessel wall trauma. The gene 
expresses at least three transcripts. 

10289] CPS 193 corresponds to CBP2 (SERPINH2) which encodes serine (or 

cysteine) proteinase inhibitor, clade H (heat shock protein 47), member 2. This gene has 
LocusID: 872, and is located on chromosome 11 with reported cytogenetic location 
1 lql3.5. The gene resides in genomic locus NTJJ33927 (NCBI Genome Annotation). The 
gene product is also known as Colligin-2, which is a collagen-binding protein that acts as a 
heat shock protein. 

[0290] CPS 193 also has 92% sequence identity with a chromosome 9 region near 

pshsp47. Pshsp47 encodes serine (or cysteine) proteinase inhibitor, clade H (heat shock 
protein 47), member 2 pseudogene. It has LocusID: 158172 and reported cytogenetic 
location 9pl 1.2. 

[0291] Aflymetrix annotation suggests that CPS 195 corresponds to IQGAP1 which 

encodes IQ motif containing GTPase activating protein 1. The gene has LocusID: 8826 
with reported cytogenetic location 15q26.L 

[0292] Blast search of the Entrez human genome sequence database shows that 

nucleotides 395 to 5984 of L33075 align with IQGAP1 with 98% sequence identity. 
IQGAP1 encodes IQ motif containing GTPase activating protein 1. The gene has LocusID: 
8826, and is located on chromosome 15 with reported cytogenetic location 15q26.1. The 
gene resides in genomic locus NT_033276. The gene product contains a GTPase activating 
domain and multiple calmodulin binding domains, and can bind to actin cytoskeleton and 
inhibit GTPase activity of ras family of GTP binding proteins Cdc42Hs and rac. 
[0293] CPS 196 corresponds to KIAA0823 (PPP1R16B) which encodes protein 

phosphatase 1, regulatory (inhibitor) subunit 16B. This gene has LocusID: 26051, and is 
located on chromosome 20 with reported cytogenetic location 20qll.23. The gene resides 
in genomic locus, NT_01 1362 (NCBI Genome Annotation). 

[0294] CPS 197 corresponds to UNKAI018098 (MGC15523) which encodes 

hypothetical protein MGC15523. This gene has LocusID: 124565, and is located on 
chromosome 17 with reported cytogenetic location 17q25.3. The gene resides in genomic 
locus NT_010661 (NCBI Genome Annotation). 
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[0295] Affymetrix annotation suggests that CPS 198 corresponds to CD163. 

CD163 encodes CD163 antigen, and has LocusID: 9332. The gene is located at 
chromosome 12pl3.3. 

[0296] CPS 199 corresponds to IDG which encodes hexokinase 3 (white cell). This 

gene has LocusID: 3101, and is located on chromosome 5 with reported cytogenetic 
location 5q35.2. The gene resides in genomic locus NT_023132 (NCBI Genome 
Annotation). Hexokinases phosphorylate glucose to produce glucose-6-phosphate, thus 
committing glucose to the glycolytic pathway. HK3 gene encodes hexokinase 3 which is 
similar to hexokinases 1 and 2. Hexokinase 3 can be inhibited by its product glucose-6- 
phosphate. 

[0297] CPS 200 corresponds to FOS which encodes v-fos FBJ murine osteosarcoma 

viral oncogene homolog. This gene has LocusID: 2353, and is located on chromosome 14 
with reported cytogenetic location 14q24.3. The gene resides in genomic locus NTJJ26437 
(NCBI Genome Annotation). The Fos gene family has 4 members: FOS, FOSB, FOSL1, 
and FOSL2. These genes encode leucine zipper proteins that can dimerize with proteins of 
the JTJN family, thereby forming the transcription factor complex AP-1. As such, the FOS 
proteins have been implicated as regulators of cell proliferation, differentiation, and 
transformation. In some cases, expression of the FOS gene has also been associated with 
"apoptotic "cell death. ~ FOS gene product may also be involved in alteration of DNA 
methylation. 

[0298] CPS 201 corresponds to DIFF48 (C6orf32) which encodes chromosome 6 

open reading frame 32. This gene has LocusID: 9750, and is located on chromosome 6 with 
reported cytogenetic location 6p22.3-p21.32. The gene resides in genomic locus 
NT_007592 (NCBI Genome Annotation). The protein encoded by DIFF48 can stimulate 
the formation of a non-mitotic multinucleate syncytium from proliferative cytotrophoblasts 
during trophoblast differentiation. An alternatively spliced transcript variant of this gene 
has been described. 

[0299] CPS 202 corresponds to UNKL23134 (GZMM) which encodes granzyme 

M (lymphocyte met-ase 1). This gene has LocusID: 3004, and is located on chromosome 
19 with reported cytogenetic location 19pl3.3. The gene resides in genomic locus 
NTJH1227 (NCBI Genome Annotation). Human natural killer (NK) cells and activated 
lymphocytes express and store a distinct subset of neutral serine proteases together with 
proteoglycans and other immune effector molecules in large cytoplasmic granules. These 
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serine proteases are collectively termed granzymes and include at least four distinct gene 
products: granzyme A, granzyme B, granzyme H, and Met-ase which is also known as 
granzyme M. 

[0300] CPS 203 corresponds to LM02 which encodes LIM domain only 2 

(rhombotin-like 1). This gene has LocusID: 4005, and is located on chromosome 11 with 
reported cytogenetic location 1 lpl3. The gene resides in genomic locus NTJ)09237 (NCBI 
Genome Annotation). LM02 encodes a cysteine-rich, two LIM-domain protein that is 
involved in yolk sac erythropoiesis. The LM02 protein has a role in hematopoietic 
development. The LM02 transcription start site is located approximately 25 kb 
downstream from the 1 lpl3 T-cell translocation cluster (1 lpl3 ttc), where a number T-cell 
acute lymphoblastic leukemia-specific translocations occur. The LM02 protein is a 
member of the rhombotin family. 

[0301] CPS 204 corresponds to PDI2 (PADI2) which encodes peptidyl arginine 

deiminase, type II. This gene has LocusID: 11240, and is located on chromosome 1 with 
reported cytogenetic location Ip35.2-p35.1. The gene resides in genomic locus NT_034376 
(NCBI Genome Annotation). The gene product is similar to rat skeletal muscle peptidyl 
, arginine deiminase, type H It may convert arginine residues within proteins to citrulline 
residues. 

~[0302] CPS 205 corresponds to UNK_AL031282 (FLJ13052) which encodes NAD 

kinase. This gene has LocusID: 65220, and is located on chromosome 1 with reported 
cytogenetic location Ip36.33-p36.21. The gene resides in genomic locus NT_004350 
(NCBI Genome Annotation). 

[0303] CPS 205 also aligns with a chromosomal region near MMP23B with at least 

98% sequence identity. MMP23B encodes matrix metalloproteinase 23B, and has LocusID: 
8510 and reported cytogenetic location lp36.3. The gene resides in genomic locus 
NT_004350. CPS 205 aligns with the non-protein-coding strand of MMP23B, 
[0304] CPS 206 corresponds to GABRG2 which encodes gamma-aminobutyric acid 

(GABA) A receptor, gamma 2. This gene has LocusID: 2566, and is located on 
chromosome 5 with reported cytogenetic location 5q31.1-q33.1. The gene resides in 
genomic locus NT_030698 (NCBI Genome Annotation). The gamma-aminobutyric acid 
(GABA) A receptor, gamma 2 is found as an inhibitory neurotransmitter receptor in the 
brain. 
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[0305] CPS 207 corresponds to KIAA0404 which encodes KIAA0404 protein. This 

gene has LocusID: 23130, and is located on chromosome 11 with reported cytogenetic 
location llql3.1. The gene resides in genomic locus NTJJ33241 (NCBI Genome 
Annotation). 

[0306] CPS 208 corresponds to UNKJJ12471 (THBS1) which encodes 

thrombospondin 1. This gene has LocusID: 7057, and is located on chromosome 15 with 
reported cytogenetic location 15ql5. The gene resides in genomic locus NT_030828 (NCBI 
Genome Annotation). Thrombospondin- 1 has a ro/e in blood clotting and in angiogenesis. 
It is a member of a family of adhesive molecules. 

[0307] Affymetrix annotation suggests that CPS 209 corresponds to IGHA1 which 

encodes immunoglobulin heavy constant alpha 1. The gene has LocusID: 3493 and 
reported cytogenetic location 14q32.33. 

[0308] CPS 210 corresponds to ABCB1 which encodes ATP-binding cassette, sub- 

family B (MDR/TAP), member 1; This gene has LocusID: 5243, and is located on 
chromosome 7 with reported cytogenetic location 7q21.1. The gene resides in genomic 
locus NT_007933 (NCBI Genome Annotation). The membrane-associated protein encoded 
by this gene is a member of the superfamily of ATP-binding cassette (ABC) transporters. 
ABC proteins transport various molecules across extra- and intra-cellular membranes. ABC 
genes are divided into seven distinct subfamilies (ABC1, MDR/TAP, MRP, ALD, OABP, 
GCN20, White). This protein encoded by ABCB1 gene is a member of the MDR/TAP 
subfamily. Members of the MDR/TAP subfamily are involved in multidrug resistance. The 
ABCB gene product is an ATP-dependent drug efflux pump for xenobiotic compounds with 
broad substrate specificity. It is responsible for decreased drug accumulation in multidrug- 
resistant cells and often mediates the development of resistance to anticancer drugs. It can 
also function as a transporter in the blood-brain barrier. The ABCB gene product is also 
known as P glycoprotein 1 . 

[0309] CPS 211 corresponds to NRIP1 which encodes nuclear receptor interacting 

protein 1. This gene has LocusID: 8204, and is located on chromosome 21 with reported 
cytogenetic location 21qll.2. The gene resides in genomic locus NTJH1512 (NCBI 
Genome Annotation). Nuclear receptor interacting protein 1 is a nuclear protein that can 
interact with the hormone-dependent activation domain AF2 of nuclear receptors. Also 
known as RIP140, this protein modulates transcriptional activity of the estrogen receptor. 
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[0310] CPS 213 corresponds to GLB1 which encodes galactosidase, beta 1. This 

gene has LocusID: 2720, and is located on chromosome 3 with reported cytogenetic 
location 3p21.33. The gene resides in genomic locus NT_005580 (NCBI Genome 
Annotation). The gene product can catalyze cleavage of the terminal galactose. 
[0311] CPS 214 corresponds to MSC which encodes musculin (activated B-cell 

factor-1). This gene has LocusID: 9242, and is located on chromosome 8 with reported 
cytogenetic location 8q21. The gene resides in genomic locus NT_034895 (NCBI Genome 
Annotation), The gene product contains a bHLH moti$ a putative nuclear localization 
signal, a glycine-rich region, and a stretch of acidic residues. The gene product is capable 
of binding an E-box element either as a homodimer or as a heterodimer with E2A in vitro, 
and forms heterodimers with.E2A proteins in vivo. It also contains a transcriptional 
repression domain and is capable of inhibiting the transactivation capability of E47, an E2A 
protein, in mammalian cells. MSC is thought to be a downstream target of the B-cell 
receptor signal transduction pathway. 

[0312] CPS 216 corresponds to UNKU95006 (MGC14480) which encodes 

hypothetical protein MGC14480. This gene has LocusID: 201254, and is located on 
chromosome 17 with reported cytogenetic location 17q25.3. The gene resides in genomic 
locus NT_035479 (NCBI Genome Annotation). 

10313] ^Nucleotides 400 to 561 of CPS 216 also have 84% sequence identity with an 
intron sequence of EG1. EG1 encodes endothelial-derived gene 1, and has LocusID: 80306 
and reported cytogenetic location 4p J 6. 

[0314] CPS 217 corresponds to NK4 which encodes natural killer cell transcript 4. 

This gene has LocusID: 9235, and is located on chromosome 16 with reported cytogenetic 
location 16pl3.3. The gene resides in genomic locus NT_010552 (NCBI Genome 
Annotation). The gene product may play a role in cell adhesion. It contains an RGD motif. 
[0315] CPS 217 also has 96% sequence identity with LOC124213 which encodes a 

protein similar to natural killer cell transcript 4. LOC124213 is located at chromosome 
16pl3.1 1, and also resides in genomic locus NTJ)10552, 

[0316] CPS 218 corresponds to UNK_Z24724 (FLJ20986) which encodes 

hypothetical protein FLJ20986. This gene has LocusID: 79572, and is located on 
chromosome 3 with reported cytogenetic location 3q29. The gene resides in genomic locus 
NTJ)05535 (NCBI Genome Annotation). The alignment between CPS 218 and 
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UNK_Z24724 is within an intron of GP5 which encodes glycoprotein V. {platelet) and has 
LocusID: 2814. 

[0317] CPS 219 corresponds to GRN which encodes granulin. This gene has 

LocusID: 2896, and is located on chromosome 17 with reported cytogenetic location 
17q21.32. The gene resides in genomic locus NT_0 10755 (NCBI Genome Annotation). 
Granulin is a putative growth factor. It is cysteine rich and contains multiple granulin 
repeats. i 

[0318] CPS 220 corresponds to EREG which encodes epiregulin. This gene has 

LocusID: 2069, and is located on chromosome 4 with reported cytogenetic location 4ql3.3. 
The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). Epiregulin is 
a member of the epidermal growth factor family. Epiregulin can function as a ligand of 
EGFR (epidermal growth factor receptor), as well as a ligand of most members of the 
ERBB (v-erb-b2 oncogene homolog) family of tyrosine-kinase receptors. Epiregulin may 
promote cell proliferation. 

[0319] CPS 221 corresponds to KIAA0382 (ARHGEF12) which encodes Rho 

guanine nucleotide exchange factor (GEF) 12. This gene has LocusID: 23365, and is 
located on chromosome 1 1 with reported cytogenetic location llq23.3. The gene resides in 
genomic locus NT_033899 (NCBI Genome Annotation). Rho GTPases play a role in 
numerous cellular processes that are initiated by extracellular stimuli that work through G 
protein coupled receptors. Rho guanine nucleotide exchange factor (GEF) 12 may form a 
complex with G proteins and stimulate Rho-dependent signals. This protein is observed to 
form myeloid/lymphoid fusion partner in acute myeloid leukemia. 

[0320] CPS 222 corresponds to RNASE6 which encodes ribonuclease, RNase A 

family, k6. This gene has LocusID: 6039, and is located on chromosome 14 with reported 
cytogenetic location 14qll.L The gene resides in genomic locus NT_025892 (NCBI 
Genome Annotation). Ribonuclease k6 may function in host defense. It is a member of 
eosinophil-derived neurotoxin ribonuclease A superfamily. 

[0321] CPS 223 corresponds to KIAA0442 which encodes autism-related protein 1 . 

This gene has LocusID: 26053, and is located on chromosome 7 with reported cytogenetic 
location 7pl3. The gene resides in genomic locus NT_007758 (NCBI Genome 
Annotation). 

[0322] Blast search of the Entrez human genome sequence database shows that 

nucleotides 35 to 1907of CPS 225 align with various chromosomal regions with 89-90% 
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sequence identity. These regions include LOC255718, LOC140070, and a chromosome 2 
region between GPR39 and LOC255521. LOC255718 is located on chromosome 2 and has 
genomic locus NT_034486 (NCBI Genome Annotation). LOC140070 encodes a protein 
similar to Protein CDC27Hs (Cell division cycle protein 27 homolog) (H-NUC). It is 
located on chromosome Yqll.l and has genomic locus NT 011878. GPR39 and 
LOC255521 reside in genomic locus NT_034487. GPR39 encodes G protein-coupled 
receptor 39, and has LocusID: 2863 and reported cytogenetic location 2q21-q22. 
[0323] Nucleotides 1317 to 1481 of CPS 225 have 98% sequence identity with a 

chromosome 12 region near LOC256746. LOC256746 encodes phosphodiesterase 3 A, 
cGMP-inhibited. Nucleotides 1381 to 1418 and 1461 to 1502 have 100% sequence identity 
with an intron sequence of LOC126500. LOCI 26500 encodes a protein similar to 
hypothetical zinc finger protein KIAA1473. It is located on chromosome 19pl3.11, and 
resides in genomic locus NT 033317. Other fiagments of CPS 225 have 84-93% sequence 
identity with regions on chromosomes 7, 11, 14, 18 and 21. 

[0324] Blast search of the Entrez human genome sequence database shows that CPS 

226 aligns to a region located in an intron of ATP8A. CPS 226 aligns with the non-protein- 
coding strand of ATP8A2. ATP8A2 encodes ATPase, aminophospholipid transporter-like, 
Class I, type 8 A, member 2. ATP8A2 gene has LocusID: 51761, and is located on 
_ chromos6niri3 with "reported cytogenetic location 13ql2-13. ATP8A2 resides in genomic 
locus NT_009799 (NCBI Genome Annotation). ATP8A2 gene product may be a tumor 
suppressor. Loss of its function can convert cells to tumorigenic phenotype. 
[0325] CPS 227 corresponds to ARHC which encodes ras homolog gene family, 

member C. This gene has LocusID: 389, and is located on chromosome 1 with reported 
cytogenetic location Ip21-pl3. The gene resides in genomic locus NT_019273 (NCBI 
Genome Annotation). The gene product can regulate reorganization of the actin 
cytoskeleton. CPS 227 aligns with the 3' UTR region of ARHC. The. alignment is also 3' 
to MOV10 which encodes MovlO, Moloney leukemia virus 10, homolog (mouse). MOV10 
has LocusID: 4343 and reported cytogenetic location lpl2. It resides in genomic locus 
NT_019273. 

[0326] CPS 228 corresponds to CREM which encodes cAMP responsive element 

modulator. This gene has LocusID: 1390, and is located on chromosome 10 with reported 
cytogenetic location 10pl2.1-pll.l. The gene resides in genomic locus NT_033896-(NCBI 
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Genome Annotation). Cyclic AMP-responsive element modulator is a regulator of the 
transcription of cAMP-inducible genes. 

[0327] CPS 229 corresponds to ISG15 which encodes interferon-stimulated protein, 

15 kDa. This gene has LocusID: 9636, and is located on chromosome 1 with reported 
cytogenetic location lp36.33. The gene resides in genomic locus NT_004350 (NCBI 
Genome Annotation). The expression of ISG15 can be induced by interferon, 
[03281 CPS 230 corresponds to CCR7 which encodes chemokine (C-C motif) 

receptor 7. This gene has LocusID: 1236, and is located on chromosome 17 with reported 
cytogenetic location 17ql2-q21.2 . The gene resides in genomic locus NT_024901 (NCBI 
Genome Annotation). The gene product is a G protein-coupled receptor, and can bind to 
CC chemokine ELC and mediate intracellular calcium flux. 

[0329] CPS 231 corresponds to Fll which encodes coagulation factor XI (plasma 

thromboplastin antecedent). This gene has LocusID: 2160, and is located on chromosome 4 
with reported cytogenetic location 4q35. The gene resides in genomic locus NT_022792 
(NCBI Genome Annotation). Alternative splicing of the geiie results in at least two 
transcripts. One transcript encodes the circulating plasma factor XI and an alternate 
transcript lacking exon 5 encodes the platelet factor XI. The plasma factor XI is present in 
plasma as a zymogen. The platelet factor XI is localized to platelets and megakaryocytes, 
and may play a role both in the maintenance of normal hemostasis and as a substitute for 
plasma factor XL 

[0330] Blast search of the Entrez human genome sequence database shows that CPS 

232 aligns with a chromosome 4 region near LOCI 66760. This chromosome 14 region 
resides in genomic locus NT_006344 (NCBI Genome Annotation). LOCI 66760 has 
reported cytogenetic location 4pl 5.32. 

[0331] CPS 233 corresponds to UNK_AC002073 (MGC17330) which encodes 

hypothetical protein MGC17330. This gene has LocusID: 113791, and is located on 
chromosome 22 with reported cytogenetic location 22qll.2-q22. The gene resides in 
genomic locus NT_01 1 520 (NCBI Genome Annotation). 

[0332] CPS 234 corresponds to STK17A which encodes serine/threonine kinase 17a 

(apoptosis-inducing). This gene has LocusID: 9263, and is located on chromosome 7 with 
reported cytogenetic location 7pl2-pl4. The gene resides in genomic locus NT_007819 
(NCBI Genome Annotation). The alignment of CPS 234 and STK17A resides in an intron 
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of FLJ10803. FLJ10803 encodes hypothetical protein FLJ10803, and has LocusID: 55744 
and reported cytogenetic location 7pl 5 . 1 . 

[0333] CPS 235 corresponds to BLVRA which encodes biliverdin reductase A. 

This gene has LocusID: 644, and is located on chromosome 7 with reported cytogenetic 
location 7pl4-cen. The gene resides in genomic locus NT_007819 (NCBI Genome 
Annotation). 

[0334] CPS 236 corresponds to SPP1 which encodes secreted phosphoprotein 1 

(osteopontin, bone sialoprotein I, early T-lymphocyte activation 1). This gene has LocusID: 
6696, and is located on chromosome 4 with reported cytogenetic location 4q21-q25. The 
gene resides in genomic locus NT_006204 (NCBI Genome Annotation). The gene product 
is a bone and blood vessel extracellular matrix protein involved in calcification and 
atherosclerosis. 

[0335] CPS 237 corresponds to STX1A which encodes syntaxin 1A (brain). This 

gene has LocusID: 6804, and is located on chromosome 7 with reported cytogenetic 
location 7qll.23. The gene resides in genomic locus NT_007758 (NCBI Genome 
Annotation). Syntaxin 1A (brain) may be involved in intracellular transport and 
neurotransmitter release. 

[0336] CPS 238 corresponds to TKTL1 which encodes transketolase-like 1. This 

"gene h^~LocusIDT"8277r^id~is~lbcated _ on chromosome X with reported cytogenetic 
location Xq28. The gene resides in genomic locus NT__025965 (NCBI Genome 
Annotation). Transketolase-like 1 is a thiamine pyrophosphate-dependent enzyme of 
pentose phosphate pathway. 

[0337] CPS 239 corresponds to IER3 which encodes immediate early response 3. 

This gene has LocusID: 8870, and is located on chromosome 6 with reported cytogenetic 
location 6p21.3. The gene resides in genomic locus NT_007592 (NCBI Genome 
Annotation). This gene functions in the protection of cells from Fas or tumor necrosis 
factor type alpha-induced apoptosis. Alternative splicing of this gene results in at least two 
transcript variants. 

[0338] CPS 241 corresponds to PLAU which encodes plasminogen activator, 

urokinase. This gene has LocusID: 5328, and is located on chromosome 10 with reported 
cytogenetic location 10q24. The gene resides in genomic locus NT_033890 (NCBI 
Genome Annotation). Urokinase plasminogen activator is a serine protease that cleaves 
plasminogen to form plasmin. 
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[0339] CPS 242 corresponds to ASS which encodes argininosuccinate synthetase. 

This gene has LocusID: 445, and is located on chromosome 9 with reported cytogenetic 
location 9q34.1. The gene resides in genomic locus NT_008338 (NCBI Genome 
Annotation). The protein encoded by this gene catalyzes the penultimate step of the 
arginine biosynthetic pathway. There are approximately 10 to 14 copies of this gene 
including the pseudogenes scattered across the human genome, among which the one 
located on chromosome 9 appears to be the only, functional gene for argininosuccinate 
synthetase. Mutations in the chromosome 9 copy of ASS cause citrullinemia. At least two 
alternatively spliced transcript variants of this gene have been reported. 
[0340] CPS 242 also has 87-94% sequence identity with various regions on 

chromosome 2, 4, 5, 6, 7, 9, 11, 12 and X.. These regions include LOC167449, 
. - -- - - LOC222906, ASSP2, ASSP3, ASSP4, ASSP5, LOC158452, LOC253843, LOC120341, 
LOC167519, and an intron of intron of LOC90957. LOCI 67449 encodes a protein similar 
to argininosuccinate synthetase and resides in genomic locus NT_006431 with reported 
cytogenetic location 5qll.2. LOC222906 encodes a protein similar to argininosuccinate 
synthetase, and resides in genomic locus NT_007819 with reported cytogenetic location 
7p21.3. ASSP2 (LocusID: 447) encodes argininosuccinate synthetase pseudogerie 2, and 
resides in genomic locus NTJ)07592 with reported cytogenetic location 6p22,l, ASSP3 
(LocusID: 448) "encodes argininosuccinate synthetase pseudogene 3, and resides in genomic 
locus NTJ)33256 with reported cytogenetic location 9qll-q22. ASSP4 (LocusID: 449) 
encodes argininosuccinate synthetase pseudogene 4, and resides in genomic locus 
' NTJ)25302 with reported cytogenetic location Xpter-p22. ASSP5 (LocusID: 450) encodes 
argininosuccinate synthetase pseudogene 5, and resides in genomic locus NT_028405 with 
reported cytogenetic location Xq22-q26. LOCI 58452 encodes a protein similar to 
argininosuccinate synthetase, and resides in genomic locus NT_023935 with reported 
cytogenetic location 9q21.31. LOC253843 encodes a protein similar to argininosuccinate 
synthetase, and resides in genomic locus NTJ)06258 on chromosome 4. LOCI 20341 
encodes a protein similar to argininosuccinate synthetase, and resides in genomic locus 
NTJ)09151 with reported cytogenetic location Uq23.1. LOC167519 encodes a protein 
similar to argininosuccinate synthetase, and resides in genomic locus NT_034778 with 
reported cytogenetic location 5q31.3. LOC90957 (LocusID: 90957) encodes DEAH-box 
RNA/DNA helicase AAM73547, and resides in genomic locus NT_005367 with reported 
cytogenetic location 2p22.3. 
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[0341] Affymetrix annotation suggests that CPS 244 corresponds to DUX1 which 

encodes double homeobox, 1. The gene has LocusID: 26584. 

[0342] Blast search of the Entrez human genome sequence database shows that 

nucleotides 1 to 689 of AJ001481 (SEQ ID NO: 227) align with two chromosome 4 regions 
with 85-88% sequence identity. The first region is located 3' to LOC131308. LOC131308 
encodes a protein similar to FSHD Region Gene 2 protein. It has reported cytogenetic 
location 3pl4.1 and resides in genomic locus NT_022665. The second region is located 
near TRAP95. TRAP95 encodes thyroid hormone receptor-associated protein, 95-kD 
subunit. TRAP95 has LocusID: 10025 and reported cytogenetic location 19pl3.3. It 
resides in genomic locus NTJU1277. TRAP95 gene product is a subunit of TRAP thyroid 
hormone receptor-associated protein complex, and functions as a coactivator for nuclear 
receptors. 

[0343] CPS 245 corresponds to GPA33 which encodes glycoprotein A33 

(transmembrane). This gene has LocusID: 10223, and is located on chromosome 1 with 
reported cytogenetic location lq23.2. The gene resides in genomic locus NT_004668 
(NCBI Genome Annotation). The glycoprotein encoded by this gene is a cell surface 
antigen that is expressed in human colon cancers. The sequence of the extracellular region 
of the encoded protein contains 2 domains characteristic of the CD2 subgroup of the 
immunoglobulin (Ig) superfamily. The encoded protein may play a role in cell adhesion. 
[0344] CPS 246 corresponds to PAI1 (SERPINE1) which encodes serine (or 

cysteine) proteinase inhibitor, clade E (nexin, plasminogen activator inhibitor type 1), 
member 1. This gene has LocusID: 5054, and is located on chromosome 7 with reported 
cytogenetic location 7q21.3-q22. The gene resides in genomic locus NT_007933 (NCBI 
Genome Annotation). The gene product may regulate fibrinolysis. It is a member of the 
serpin family of serine protease inhibitors. 

[0345] CPS 247 corresponds to CYP1B1 which encodes cytochrome P450, 

subfamily I (dioxin-inducible), polypeptide 1 (glaucoma 3, primary infantile). This gene 
has LocusID: 1545, and is located on chromosome 2 with reported cytogenetic location 
2p21. The gene resides in genomic locus NT_005367 (NCBI Genome Annotation). This 
gene product is a member of the cytochrome P450 superfamily of enzymes. The 
cytochrome P450 proteins are monooxygenases which catalyze many reactions involved in 
drug metabolism and synthesis of cholesterol, steroids and other lipids. The enzyme 
encoded by CYP1B1 gene localizes to the endoplasmic reticulum and metabolizes 
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procarcinogens such as polycyclic aromatic hydrocarbons and 17beta-estradiol. Mutations 
in this gene have been associated with primary congenital glaucoma. 
[0346] CPS 249 corresponds to HU-K5 (MGLL) which encodes monoglyceride 

lipase. This gene has LocusID: 11343, and is located on chromosome 3 with reported 
cytogenetic location 3q21.3. The gene resides in genomic locus NT_005588 (NCBI 
Genome Annotation). The gene product may function in regulating levels of 
lysophospholipids. , 

[0347] Affymetrix annotation suggests that CPS 250 corresponds to DORA. DORA 

is also known as IGSF6 which encodes immunoglobulin superfamily, member 6. The gene 
has LocusID: 10261, and is located at chromosome 16pl2-pl3. 

[0348] Blast search of the Entrez human genome sequence database shows that CPS 

250 has at least 97% sequence identity with an intron sequence of DREVL DREV1 
encodes CGI-81 protein. DREV1 has LocusID: 51108, and is located on chromosome 16 
with reported cytogenetic location 16pl3-pl2. The gene resides in genomic locus 
NTJH0441 (NCBI Genome Annotation). 

[0349] CPS 253 corresponds to MD-2 which encodes MD-2 protein. This gene has 

LocusID: 23643, and is located on chromosome 8 with reported cytogenetic location 
8ql3.2. The gene resides in genomic locus NTJJ08209 (NCBI Genome Annotation). The 
MD-2 protein appears to associate with toll-like receptor 4 on the cell surface and confer 
responsiveness to lipopolysaccyaride (LPS), thus providing a link between the receptor and 
LPS signaling. 

[0350] CPS 254 corresponds to UNKU07563 (RRP4) which encodes homolog of 

yeast RRP4 (ribosomal RNA processing 4), S'-S'-exqribonuclease. This gene has LocusID: 
23404, and is located on chromosome 9 with reported cytogenetic location 9q34. The gene 
resides in genomic locus NT_008338 (NCBI Genome Annotation). The gene product, also 
known as ribosomal RNA processing 4, is similar to S. cerevisiae RRP4 which is a 
component of both the nuclear and cytoplasmic forms of the ribosomal RNA processing 3'- 
5' exosome complex. UNKU07563 or RRP4 gene product may function in ribosomal 
RNA processing. 

[0351] Affymetrix annotation suggests that CPS 255 corresponds to SPINK1 which 

encodes serine protease inhibitor, Kazal type 1. The gene has LocusID: 6690 and reported 
cytogenetic location 5q32. 
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[0352] Blast search of the Entrez human genome sequence database shows that CPS 

255 has at least 97% sequence identity with a chromosome 5 region between LOC256135 
and KIAA0555. Both LOC256135 and KIAA0555 reside in genomic locus NT_006859 
(NCBI Genome Annotation). LOC256135 encodes a protein similar to N-formyl peptide 
receptor. KIAA0555 encodes KIAA0555 gene product, and has LocusID: 9832 and 
reported cytogenetic location 5q32. 

[0353] CPS 256 corresponds to UNK_S82470 (also known as BB1 or LENG4) 

which encodes leukocyte receptor cluster (LRC) member 4. This gene has LocusID: 79143, 
and is located on chromosome 19 with reported cytogenetic location 19ql3.4. The gene 
resides in genomic locus NT_01 1 148 (NCBI Genome Annotation). 

[0354] Affymetrix annotation suggests that CPS 257 corresponds to IGHG3. The 

gene encodes immunoglobulin heavy constant gamma 3 (G3m marker), and has LocusID: 
3502. The gene is located at chromosome 14q32.33. 

[0355] Blast search of the Entrez human genome sequence database shows that CPS 

257 has at least 96% sequence identity with a chromosome 14 region near LOC122595. 
LOC122595 has reported cytogenetic location 14q32.33 and resides in genomic locus 
NT_010168 (NCBI Genome Annotation). 

[0356] CPS 260 corresponds to TNFRSF12 which encodes tumor necrosis factor 

receptor superfamily, member 12 (translocating chain-association membrane protein). This 
gene has LocusID: 8718, and is located on chromosome 1 with reported cytogenetic 
location lp36.2. The gene resides in genomic locus NTJ328054 (NCBI Genome 
Annotation). The gene product contains a cytoplasmic death domain and transmembrane 
domains. It can induce apoptosis and activates NF-kappaB. 

[0357] CPS 261 corresponds to UNK_AL031685 (KIAA0939) which encodes 

KIAA0939 protein. This gene has LocusID: 23315, and is located on chromosome 20 with 
reported cytogenetic location 20ql3.13. The gene resides in genomic locus NT_011362 
(NCBI Genome Annotation). 

[0358] CPS 262 corresponds to UNK.AL049963 (LOC64116) which encodes a 

protein up-regulated by BCG-CWS. This gene has LocusID: 64116, and is located on 
chromosome 4 with reported cytogenetic location 4q22-q24. The gene resides in genomic 
locus NTJJ06383 (NCBI Genome Annotation). 
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[0359] Affymetrix annotation suggests that CPS 263 corresponds to APELIN which 

encodes apelin, a peptide ligand for APJ receptor. The gene has LocusID: 8862, and is 
located at chromosome Xq25-26.3. 

[0360] Blast search of the Entrez human genome sequence database shows that CPS 

263 aligns with OCRL with at least 97% sequence identity. OCRL refers to 
oculocerebrorenal syndrome of Lowe. This gene has LocusID: 4952, and is located on 
chromosome X with reported cytogenetic location Xq25-q2.6.1. Hie gene resides in 
genomic locus NT_0 11786 (NCBI Genome Annotation). Mutations in this gene are linked 
to the disease oculocerebrorenal syndrome of Lowe. The encoded protein is a 
phospatidylinositol polyphosphate 5-phosphatase that can be found in golgi cisternae. 
[0361] CPS 264 corresponds to VEGF which encodes vascular endothelial growth 

factor. This gene has LocusID: 7422, and is located on chromosome 6 with reported 
cytogenetic location 6pl2. The gene resides in genomic locus NT_007592 (NCBI Genome 
Annotation). Vascular endothelial growth factor can induce endothelial cell proliferation 
and vascular permeability. 

[0362] CPS 265 corresponds to ELAVL2 which encodes ELAV (embryonic lethal, 

abnormal vision, Drosophila)-like 2 (Hu antigen B). This gene has LocusID: 1993, and is 
located on chromosome 9 with reported cytogenetic location 9p21. The gene resides in 
genomic locus NT_023974 (NCBI Genome Annotation). The gene product can bind to 3 - 
untranslated regions of mRNA. 

[0363] CPS 266 corresponds to PPP2R2A which encodes protein phosphatase 2 

(formerly 2A), regulatory subunit B (PR 52), alpha isoform. This gene has LocusID: 5520, 
and is located on chromosome 8 with reported cytogenetic location 8p21.1. The gene 
resides in genomic locus NT_023666 (NCBI Genome Annotation). 

[0364] Nucleotides 163 to 505 of CPS 266 have 96% sequence identity with an 

intron sequence of TSC22. TSC22 encodes transforming growth factor beta-stimulated 
protein TSC-22. It has LocusID: 8848, and resides in genomic locus NT_033922 with 
reported cytogenetic location 13ql4. CPS 266 aligns with the non-protein-coding strand of 
the gene. 

[0365] CPS 267 corresponds to UNKAF001862 (FYB) which encodes FYN 

binding protein (FYB-120/130). This gene has LocusID: 2533, and is located on 
chromosome 5 with reported cytogenetic location 5pl3,L The gene resides in genomic 
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locus NT_023195 (NCBI Genome Annotation). FYN-binding protein can modulate 
interleukin 2 production. 

[03661 CPS 268 corresponds to OAS1 which encodes 2',5'-oligoadenylate 

synthetase 1 (40-46 kD). This gene has LocusID: 4938, and is located on chromosome 12 
with reported cytogenetic location 12q24.1. The gene resides in genomic locus NT_009770 
(NCBI Genome Annotation). This gene product is a member of the 2', 5' oligoadenylate 
synthase family. It can be induced by interferons and catalyze the 2', 5' oligomers of 
adenosine in order to bind and activate RNase L. This gene family plays a significant role 
in the inhibition of cellular protein synthesis and viral infection resistance. Alternative 
splicing of OAS1 gene produces at least two isoforms. 

[0367] CPS 269 corresponds to CYBB which encodes cytochrome b-245, beta 

polypeptide (chronic granulomatous disease). This gene has LocusID: 1536, and is located 
on chromosome X with reported cytogenetic location Xp21.1. The gene resides in genomic 
locus NT_011657 (NCBI Genome Annotation). Cytochrome b (-245) is composed of 
cytochrome b alpha (CYBA) and beta (CYBB) chain. It has been proposed as a primary 
component of the microbicidal oxidase system of phagocytes. CYBB deficiency is one of 
five described biochemical defects associated with chronic granulomatous disease (CGD). 
In this disorder, there is decreased activity of phagocyte NADPH oxidase. Neutrophils are 
able to phagocytize bacteria but cannot kill them in the phagocytic vacuoles. The cause of 
the killing defect may be an inability to increase the cell's respiration and consequent failure 
to deliver activated oxygen into the phagocytic vacuole. 

[0368J CPS 270 corresponds to GFPT2 which encodes glutamine-fructose-6- 

phosphate transaminase 2. This gene has LocusID: 9945, and is located on chromosome 5 
with reported cytogenetic location 5q34-q35. The gene resides in genomic locus 
NT_006519 (NCBI Genome Annotation). The encoded protein is an enzyme of the 
hexosamine biosynthetic pathway. 

[03691 Affymetrix annotation suggests that CPS 271 corresponds to BNIP3 which 
encodes BCL2/adenovirus E1B 19kDa interacting protein 3. The gene has LocusID: 664, 
and is located at chromosome 14ql 1 .2-ql2. 

[0370] Blast search oftheEntrez human genome sequence database shows that CPS 

271 has 100% sequence identity with an intron sequence of LOC159348. LOC159348 is 
located at chromosome 10q26.3 and has genomic locus NT_024040 (NCBI Genome 
Annotation). Nucleotides 1 to 451 of CPS 270 have 97% sequence identity with a region 3* 
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to the protein-coding sequence of LOC254590. LOC254590 resides in genomic locus 
NT_025892 on chromosome 14. In addition, nucleotides 4 to 450 align to a chromosome 
15 region near CAPN3 with 81% sequence identity. CAPN3 encodes calpain 3, (p94), and 
has LocusID: 825 and reported cytogenetic location 15ql5.1-q21.1. CAPN3 resides in 
genomic locus NT_03 0828 . 

[03711 CPS 272 corresponds to BST1 which encodes bone marrow stromal cell 

antigen 1. This gene has LocusID: 683, and is located on chromosome 4 with reported 
cytogenetic location 4pl5. The gene resides in genomic locus NT_006344 (NCBI Genome 
Annotation). Bone marrow stromal cell antigen 1 is a stromal cell line-derived 
glycosylphosphatidylinositol-anchored molecule that facilitates pre-B-cell growth. BST1 
expression can be enhanced in bone marrow stromal cell lines derived from patients with 
rheumatoid arthritis. The polyclonal B-cell abnormalities in rheumatoid arthritis may be, at 
least in part, attributed to BST1 over-expression in the stromal cell population. 
[0372] CPS 273 corresponds to UNKU50535 (CG005) which encodes hypothetical 

protein from BCRA2 jegion. This gene has LocusID: 10443, and is located on chromosome 
13 with reported cytogenetic location 13ql2-ql3. The gene resides in genomic locus 
NT_009984 (NCBI Genome Annotation). The gene product has a region of low similarity 
to a region of rat 2\3 '-cyclic nucleotide 3 '-phosphodiesterase 
~[0373] — ~CPS 274 corresponds to~CX3CRi which encodes chemokine (CX3C) 
receptor 1. This gene has LocusID: 1524, and is located on chromosome 3 with reported 
cytogenetic location 3p21.3. The gene resides in genomic locus NT_005498 (NCBI 
Genome Annotation). CX3C chemokine receptor is a G protein-coupled receptor. It can 
mediate leukocyte migration and adhesion, bind the CX3C chemokine fractalkine and signal 
through a pertussis toxin sensitive G-protein. 

[0374] CPS 275 corresponds to CKTSF1B1 which encodes cysteine knot 

superfamily 1, BMP antagonist 1. This gene has LocusID: 26585, and is located on 
chromosome 15 with reported cytogenetic location 15ql3-ql5. The gene resides in 
genomic locus NT_024680 (NCBI Genome Annotation). The gene product is a homolog of 
Xenopus laevis Gremlin which is a secreted protein that blocks signaling of bone 
morphogenetic protein (BMP) by preventing access to the BMP receptor. 
[0375] CPS 276 corresponds to VNN2 which encodes vanin 2. This gene has 

LocusID: 8875, and is located on chromosome 6 with reported cytogenetic location 6q23- 
q24. The gene resides in genomic locus NT_025741 (NCBI Genome Annotation). This 
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gene product is a member of the Vanin family of proteins which share sequence similarity 
with each other, and also with biotinidase. The family includes secreted and membrane- 
associated proteins, a few of which have been reported to participate in hematopoietic cell 
trafficking. Members of this family possess pantetheinase activity, and may play a role in 
oxidative-stress response. Vanin 2 is a GPI-anchored cell surface molecule that plays a role 
in transendothelial migration of neutrophils. VNN2 gene lies in close proximity to, and in 
same transcriptional orientation as two other vanin genes on chromosome 6q23-q24. Two 
transcript variants encoding different isoforms have<been described for this gene. 
[0376] CPS 277 corresponds to KIAA0064 (SNX17) which encodes sorting nexin 

17. This gene has LocusID: 9784, and is located on chromosome 2 with reported 
cytogenetic location 2p23-p22. The gene resides in genomic locus NT_005204 (NCBI 
Genome Annotation). 

[0377] CPS 278 corresponds to GROl which encodes GROl oncogene (melanoma 

growth stimulating activity, alpha). This gene has LocusID: 2919,. and is located on 
chromosome 4 with reported cytogenetic location 4q21. The gene resides in genomic locus 
OTT006216 (NCBI Genome Annotation). The gene product has melanoma growth 
stimulating activity. It may be a mitogenic factor involved in inflammatory processes. 
[0378] CPS 278 has 88% sequence identity with GR02 which encodes GR02 

oncogene. GR02 has LocusID: 2920, and resides in genomic locus NT_00621-6 with 
reported cytogenetic location 4q2 1 . 

[0379] CPS 279 corresponds to KIAA0655 (HIP12) which encodes huntingtin 

interacting protein 12. This gene has LocusID: 9026, and is located on chromosome 12 
with reported cytogenetic location 12q24. The gene resides in genomic locus NT_009464 
(NCBI Genome Annotation). 

[0380] Nucleotides 13 to 332 of CPS 279 aligns to an intron of DNAH11 with 100% 

sequence identity. DNAH11 encodes dynein, axonemal, heavy polypeptide 11. It has 
LocusID: 8701, and 

[0381] resides in genomic locus NT_007819 with reported cytogenetic location 7p21. CPS 
279 aligns with the non-protein-coding strand of DNAH11. 

[0382] CPS 280 corresponds to RBI which encodes retinoblastoma 1 (including 

osteosarcoma). This gene has LocusID: 5925, and is located on chromosome 13 with 
reported cytogenetic location 13ql4.2. The gene resides in genomic locus NT_033922 
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(NCBI Genome Annotation). Retinoblastoma protein 1 is a nuclear phosphoprotein with 
DNA binding activity. It can interact with histone deacetylase to repress transcription. 
[0383] CPS 281 corresponds to KIAA0073 which encodes KIAA0073 protein. This 

gene has LocusID: 23398, and is located on chromosome 5 with reported cytogenetic 
location 5ql2.3. The gene resides in genomic locus NT_006431 (NCBI Genome 
Annotation). 

[0384] CPS 282 corresponds to CAPN4 (CAPNS1) which encodes calpain, small 

subunit 1. This gene has LocusID: 826, and is located on chromosome 19 with reported 
cytogenetic location 19ql3.13. The gene resides in genomic locus NT_011296 (NCBI 
Genome Annotation). Calpains are a ubiquitous, well-conserved family of calcium- 
dependent, cysteine proteases. Calpain I and II are heterodimeric with distinct large 
subunits associated with common small subunits. CAPN4 (CAPNS1) gene encodes a small 
subunit common to both calpain I and II and is associated with myotonic dystrophy. 
[0385] CPS 284 corresponds to MGST1L1 (PTGES) which encodes prostaglandin E 

synthase. This gene, has LocusID: 9536, and is located on chromosome 9 with reported 
cytogenetic location 9q34.3. The gene resides in genomic locus NT_029366 (NCBI 
Genome Annotation). Prostaglandin (PG) E synthase is involved in eicosanoid and 
glutathione metabolism. It is a member of superfamily of membrane associated proteins. 
[0386] CPS 285 corresponds to~ClNH (SERPING1) which encodes serine (or 

cysteine) proteinase inhibitor, clade G (CI inhibitor), member 1, (angioedema, hereditary). 
This gene has LocusID: 710, and is located on chromosome 11 with reported cytogenetic 
location Ilql2-ql3.1. The gene resides in genomic locus NT_033903 (NCBI Genome 
Annotation). The protein encoded by C1NH gene can inhibit activated Clr and Cls of the 
first complement component and thus regulate complement activation. Deficiency of the 
encoded protein may be associated with hereditary angioneurotic oedema (HANE). The 
encoded protein is a member of the serine protease inhibitor (serpin) superfamily. 
[0387] Affymetrix annotation suggests that CPS 286 corresponds to H1FX. H1FX 

encodes HI histone family, member X. It has LocusID: 897 1 . 

[0388] CPS 287 corresponds to IL1B which encodes interleukin 1, beta. This gene 

has LocusID: 3553, and is located on chromosome 2 with reported cytogenetic location 
2ql4. The gene resides in genomic locus NT 022135 (NCBI Genome Annotation). 
Interleukin 1 beta may initiate and amplify the immune and inflammatory responses. 
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[0389] CPS 288 corresponds to F3 which encodes coagulation factor III 

(thromboplastin, tissue factor). This gene has LocusID: 2152, and is located on 
chromosome 1 with reported cytogenetic location Ip22-p21. The gene resides in genomic 
locus NT_021979 (NCBI Genome Annotation). Coagulation factor III which is a cell 
surface glycoprotein. This factor enables cells to initiate the blood coagulation cascades, 
and it functions as the high-affinity receptor for the coagulation factor VII. The resulting 
complex provides a catalytic event that is responsible for initiation of the coagulation 
protease cascades by specific limited proteolysis. There are at least three distinct domains 
of this factor: extracellular, transmembrane, and cytoplasmic. The factor functions in 
normal hemostasis, and is a component of the cellular immune response, 
[0390] CPS 289 corresponds to DUSP4 which encodes dual specificity phosphatase 

4, This gene has LocusID: 1 846, and is located on chromosome 8 with reported cytogenetic 
location 8pl2-pll. The gene resides in genomic locus NTJ)30743 (NCBI Genome 
Annotation). The protein encoded by this gene is a member of the dual specificity protein 
phosphatase subfamily. These phosphatases inactivate their target kinases by 
dephosphorylating both the phosphoserine/threonine and phosphotyrosine residues. They 
negatively regulate members of the mitogen-activated protein (MAP) kinase superfamily 
(MAPK/ERK, SAPK/JNK, p38), which are associated with cellular proliferation and 
differentiation. DUSP4 gene product can inactivate ERK1, ERK2 and JNK, is expressed in 
a variety of tissues, and is localized in the nucleus. Two alternatively spliced transcript 
variants, encoding distinct isoforms, have been observed for this gene. In addition, multiple 
polyadenylation sites have been reported. 
. [0391] Affymetrix annotation suggests that CPS 290 corresponds to HUMRTVLH3. 

HUMRTVLH3 encodes endogenous retroviral protease. The gene has LocusID: 51354. 
[0392] CPS 291 corresponds to UNK_AL049250 (BANP) which encodes BTG3 

associated nuclear protein. This gene has LocusID: 54971, and resides in genomic locus 
NTJH0859 on chromosome 18 (NCBI Genome Annotation). The gene product can 
interact with CAF1, a component of the general transcription multisubunit complex. It is 
thought that BTG3 is involved in negative control of the cell cycle. The protein encoded by 
BTG3 gene binds to BTG3. Studies with mouse homolog suggest that this protein may also 
interact with a specific nuclear matrix/scaffold-associated region (MAR). Transcript 
variants encoding different isoforms have been described for this gene. The alignment 
between CPS 291 and BANP overlaps LOC124302 which encodes a protein similar to 
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nuclear pore complex interacting protein. LOC124302 resides in genomic locus 
NT_0 1 0859 with reported cytogenetic location 1 8pl 1 . 1 . 

[0393] CPS 291 also aligns with other genes and chromosomal regions with at least 

97% sequence identity. These genes and chromosomal regions include LOC92267, 
LOC146181, LOC146136, KIAA0220, LOC253666, and an intron of LOC220S65, a region 
near LOC255019, a region near LOC83985, a region 5' to the protein-coding sequence of 
NPIP, and a region 3' to the protein-coding sequence of EIF3S8. LOC92267 encodes a 
protein similar to hypothetical protein FLJ12363. LOC92267 resides in NT_010441 with 
reported cytogenetic location I6pl2.1. CPS 291 aligns with the non-protein-coding strand 
of LOC92267. LOC146181 encodes a protein similar to Hypothetical protein KIAA0220, 
and resides in genomic locus. NT_0 10441 with reported cytogenetic location 16pl2.2. 
LOC146136 encodes a protein similar to nuclear pore complex interacting protein, and 
resides in genomic locus NT_010441 with reported cytogenetic location 16pl2.1. 
KIAA0220 has LocusID: 23117, and resides in genomic locus NT_035362 with reported 
cytogenetic location 16pl2.1. LOC253666 has similarity to rat kidney-specific (KS) gene. 
LOC253666 resides in genomic locus NT 010604 on chromosome 16. LOC220565 
encodes a protein similar to PI-3-kinase-related kinase SMG-1, isoform 1; lambda/iota 
protein kinase C-interacting protein; pho^hatidylmositol 3-kinase-related protein kinase, 
LOC220565 is located at chromosome 16ql3, and resides in genomic locus NT 033291. 
LOC255019 encodes a protein similar to nuclear pore complex interacting protein, and 
resides in genomic locus NT 035360 on chromosome 16. LOC83985 encodes spinster-like 
protein, and has LocusID: 83985^ LOC83985 resides in genomic locus NT_035372 with 
reported cytogenetic location 16ql3. NPIP (LocusID: 9284) encodes a nuclear pore 
complex interacting protein, and resides in genomic locus NT 035359 with reported 
cytogenetic location 16pl3-pll. EIF3S8 encodes eukaryotic translation initiation factor 3, 
subunit 8 (HOkD). EIF3S8 has LocusID: 8663, and resides in genomic locus NT_010589 
with reported cytogenetic location 16pll.2. LOC146452 encodes a protein similar to 
. KIAA0251 hypothetical protein. It resides in genomic locus NT_010478 with reported 
cytogenetic location 16q22.3. 

[0394] Nucleotides 1 to 196 of CPS 291 have about 86% sequence identity with 

LOCI 1 8735. LOCI 1 8735 encodes a protein similar to apoptosis response protein (prostate 
apoptosis response protein 4). LOCI 18735 resides in NT.030059 with reported 
cytogenetic location 10q24.2. 
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[0395] CPS 292 corresponds to UNK_AB002308 (KIAA0310) which encodes 

KIAA0310 gene product. This gene has LocusID: 9919, and is located on chromosome 9 
with reported cytogenetic location 9q34.3. The gene resides in genomic locus NT_033215 
(NCBI Genome Annotation), 

[0396] Asymetrix annotation suggests that CPS 293 corresponds to EDN1 which 

encodes endothelin 1. The gene has LocusID: 1906, and is located at chromosome 6p24.1. 
[0397] CPS 293 aligns with IL1B with at least 99% sequence identity. IL1B 

encodes interleukin 1 , beta* J 

[0398] CPS 295 corresponds to FRAT2 which encodes frequently rearranged in 

advanced T-cell lymphomas 2. This gene has LocusID: 23401, and is located on 
chromosome 10 with reported cytpgenetic location 10q23-q24.1. The gene resides in 
genomic locus NT_03005 (NCBI Genome Annotation). 

[0399] CPS 297 corresponds to IL8 which encodes interleukin 8. This gene has 

LocusID: 3576, and is located on chromosome 4 with reported cytogenetic location 4ql3- 
q2L The gene resides in genomic locus NT_006216 (NCBI Genome Annotation). 
Interleukin 8 is a cytokine that plays a role in chemoattraction and activation of neutrophils. 
It has similarity to several platelet-derived factors. 

[0400] Affymetrix annotation suggests that CPS 298 corresponds to FSCN2. 

FSCN2 encodes fascin homolog 2, actm-bundling protein, retinal (Strongylocentrotus 
purpuratus). The gene has LocusID: 25794, and is located at chromosome 17q25. 
[0401] CPS 300 corresponds to CAMP which encodes cathelicidin antimicrobial 

peptide. This gene has LocusID: 820, and is located on chromosome 3 with reported 
cytogenetic location 3p21.3. The gene resides in genomic locus NT_022567 (NCBI 
Genome Annotation). Cathelicidin antimicrobial peptide is a precursor of a peptide with 
antibacterial activity. 

[0402] CPS 301 corresponds to FCGR1A which encodes Fc fragment of IgG, high 

affinity la, receptor for (CD64). This gene has LocusID: 2209, and is located on 
chromosome 1 with reported cytogenetic location lq2L2-q21.3. The gene resides in 
genomic locus NT_032962 (NCBI Genome Annotation). The gene product, also known as 
Fc gamma RI, is a receptor for the Fc domain of IgG. It is a member of the 
immunoglobulin superfamily, and may have a role in immune response. 
[0403] CPS 302 corresponds to MAT1A which encodes methionine 

adenosyltransferase I, alpha. This gene has LocusID: 4143, and is located on chromosome 
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10 with reported cytogenetic location 10q22. The gene resides in genomic locus 
NT_033890 (NCBI Genome Annotation). Methionine adenosyltransferase I {alpha 
isoform) catalyzes the formation of S-adenosylmethionine from methionine and ATP. Both 
the heta and alpha isoforms may be encoded by MATIA. Methionine adenosyltransferase 
deficiency is known to be caused by recessive as well as dominant mutations, the latter 
identified in autosomal dominant persistant hypermethioninemia. 

[0404] CPS 303 corresponds to CYP4B1 which encodes cytochrome P450, 

subfamily IVB, polypeptide 1. This gene has LocusID: 1580, and is located on 
chromosome 1 with reported cytogenetic location Ip34-pl2. The gene resides in genomic 
locus NT_004386 (NCBI Genome Annotation). This gene product is a member of the 
cytochrome P450 heme-binding monooxygenase superfamily, and can metabolize steroids, 
fatty acids and xenobiotics. 

[0405] Affymetrix annotation suggests that CPS 304 corresponds to MUC3. MUC3 

is also known as MUC3A, and encodes mucin 3A, intestinal. The gene has LocusID: 4584 
and reported cytogenetic location 7q22. 

[0406] Blast search of the Entrez human genome sequence database shows that CPS 

304 aligns to regions 5* to the protein-coding strand of MUC3B with 80-81% sequence 
identity. MUC3B encodes mucin 3B. It has LocusID: 57876 and reported cytogenetic 
location 7q22. The gene resides in genomic locus NT_007933. 

[0407] CPS 305 corresponds to B7 which encodes B7 protein. This gene has 

LocusID: 10233, and is located on chromosome 12 with reported cytogenetic location 
12pl3. The gene resides in genomic locus NT_035206 (NCBI Genome Annotation). The 
gene product has similarity to the regulatory subunit of protein phosphatases. It contains 
leucine rich repeats, and may mediate protein-protein interactions. 

[0408] CPS 306 corresponds to SCYA20 which encodes small inducible cytokine 

subfamily A (Cys-Cys), member 20. This gene has LocusID: 6364, and is located on 
chromosome 2 with reported cytogenetic location 2q33-q37. The gene resides in genomic 
locus NT_005403 (NCBI Genome Annotation). The protein encoded by this gene is a 
chemotactic factor for lymphocytes. 

[0409] CPS 307 corresponds to IL1RN which encodes interleukin 1 receptor 

antagonist. This gene has LocusID: 3557, and is located on chromosome 2 with reported 
cytogenetic location 2ql4.2. The gene resides in genomic locus NT_02213 (NCBI Genome 
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Annotation). Interleukin 1 receptor antagonist binds to and inhibits the IL-1 receptor. It is a 
member of the interleukin- 1 (IL-1) family. 

[0410] CPS 308 corresponds to ZNF261 which encodes zinc finger protein 261. 

This gene has LocusID: 9203, and is located on chromosome X with reported cytogenetic 
location Xql3.1. The gene resides in genomic locus NTJH9696 (NCBI Genome 
Annotation). The gene product contains a putative zinc-binding motif (MYM). 
[0411] CPS 309 corresponds to LGALS2 which encodes lectin, galactoside-binding, 

soluble, 2 (galectin 2). This gene has LocusID: 3^957, and is located on chromosome 22 
with reported cytogenetic location 22ql3.1. The gene resides in genomic locus NT_01 1520 
(NCBI Genome Annotation). Galectin 2 is an actose-binding lectin involved in cell growth 
regulation. 

[0412] CPS 310 corresponds to TGN51 (TGOLN2) which encodes trans-golgi 

network protein 2. This gene has LocusID: 10618, and is located on chromosome 2 with 
reported cytogenetic location 2pll.2. The gene resides in genomic locus NT_015805 
(NCBI Genome Annotation). 

[0413] The biological mechanisms underlying the CCI-779 modulation of the 

expression levels of the CCI-779 activity genes have yet to be elucidated. Without being 
limited to any specific theory, the modulation may be attributed to the direct effect of CCI- 
779 on PBMCs or other blood cells. It may also be caused by the effect of CCI-779 on 
renal cell carcinoma tumors, which in turn induce the change of the gene expression profile 
in the peripheral blood cells. 

[0414] Some of the CCI-779 activity genes are RCC disease genes that are 

differentially expressed in PBMCs of RCC patients relative to tumor-free humans. These 
genes include, for example, TUBB, CTSL, SCYA2, PAI2, UNK.AI679353, SCYA7, 
IL1R1, THBS1, NCF1, ATP2B1, GR02, PRF1, DBP, FCGR3B, ADFP, GR03, C1QR, 
RNASE2, CBP2, HK3, FOS, PDI2, UNKU12471 (THBS1), EREG, SPP1, STX1A, 
TKTL1, DUX1, SPINK1, UNK_AL049963 (LOC64116), BNIP3, GROl, IL1B, F3, 
UNKAL049250, EDN1, FCGR1A, MUC3, B7, SCYA20, IL1RN, and ZNF261. Over the 
course of CCI-779 therapy, at least one subset of these genes, which are elevated or 
suppressed in RCC patients, return to the normal baseline levels. 

[0415] Table 6 shows examples of RCC disease genes whose expression profiles in 

PBMCs can be modulated by CCI-779. Modulation of the expression profiles of these 
genes in ex vivo conditions is also indicated. 
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[0416] In one experiment, peripheral blood mononuclear cells isolated from tumor- 

free humans were treated with CCI-779 ex vivo. Comparison of the genes sensitive to CCI- 
779 treatment ex vivo with the CCI-779 activity genes of the present invention revealed a 
common set of genes. These genes represent surrogate markers of CCI-779 drug activity in 
vivo as well as ex vivo. 

[0417] As appreciated by those skilled in the art, the above-described methodology 

can be employed to identify genes whose expression profiles in PBMCs can be modulated 
by other drugs. 
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Ex Vivo Description 


2-fold downregulated 
by 300 nM CCI alone 
@24h 


2-fold downregulated 
by 300 nMCCI alone 
@24h AND 2-fold 
induced by PHA, 2-fold 
downregulated by 300 
nMCCI 


2-fold downregulated 
by 300 nM CCI alone 
@24h AND 2-fold 
induced by PHA, 2-fold 
downregulated by 300 
nMCCI 


2-fold downregulated 
by 300 nMCCI alone 
&24h 


2-fold downregulated 
by 300 nM CCI alone 
@24h 


2 fold induced by PHA, 
2-fold downregulated 
by 300 nMCCI 


2-fold downregulated 
by 300 nM CCI alone 
@24h AND 2-fold 
induced by PHA, 2-fold 
downregulated by 300 
nMCCI 


2-fold downregulated 
by 300 nM CCI alone 
@24h 


RCC 16wks 
with CCI- 
779 Average 


57.24 


59.90 


33.95 


1 

11.67 


19.95 


28.38 

■ 


36.29 


23.52 


RCC 8wks 
with Ca-779 
Average 
(n=21) 


36.24 


39.62 


160.10 


45.10 


13.95 


89.29 


DO 
OO 


. 29.67 


RCC 
Baseline 
Average 


20.81 


34.14 


287.95 


66.81 


8.24 


172.52 


83.90 


27.95 


RCC-Free 
Average 


11.30 


18.25 


84.55 


19.55 


4.05 


38.45 


16.60 


5.65 


Gene Description 


ribonuclease, RNase A family, 
2 (liver, eosinophil-derived 
neurotoxin) 


leukocyte immunoglobulin-like 
receptor, subfamily B (with 
TM and ITIM domains), 
member 3 


plasminogen activator 
inhibitor, type H (arginine- 
serpin) 


GR02 oncogene 


Fc fragment of IgG, high 
affinity la, receptor for (CD64) 


cathepsinL 


interleukin 1 receptor 
antagonist 


Hofno sapiens mRNA; cDNA 
DKFZp566H033 (from clone 
DKFZp566H033) 


Entrez Accession 


X55988 


AF025533 


Y00630 


M36820 


M63835 


X12451 


X52015 


AL096744 


Gene Name 


RNASE2 


LELRB3 


PAI2 
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O 


FCGRIA 


CTSL 


ILIRN 
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oo 



TUBB 


SCYA2 


GROl 


SCYA20 


SELP 


1 

NCFl 


SCYA7 


FABP5 


p 

I—* 


Gene Name 


X79535 


M26683 


X54489 


U64197 


M25322 


M55067 


X72308 
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C. Monitoring CCI-779 Drug Activities 

[0418] The CCI-779 activity genes identified in the present invention -can be used to 

monitor CCI-779 drug activities in a patient who is subject to a CCI-779 treatment. 
Peripheral blood samples can be isolated at different stages of the CCI-779 treatment. The 
expression profile of one or more CCI-779 activity genes in these peripheral blood samples 
can be determined and compared to a reference expression profile. A change in the gene 
expression profile indicates in vivo activities of CCI-779. In one embodiment, the patent 
has RCC or another solid tumor. 

[0419] Numerous methods are available for detecting gene expression profiles. In 

one aspect, the expression profiles of CCI-779 activity genes are determined by measuring 
the levels of RNA transcripts of these genes in peripheral blood samples. Suitable methods 
for this purpose include, but are not limited to, RT-PCT, Northern Blot, in situ 
hybridization, slot-blotting, nuclease protection assay, and nucleic acid arrays. The 
peripheral blood samples can be, without limitation, whole blood samples or samples 
containing enriched PBMCs. 

[0420] In one embodiment, RNA isolated from peripheral blood samples can be first 

amplified to cDNA or cRNA. The amplification can be specific or non-specific. Suitable 
amplification methods include, but are not limited to, reverse transcriptase PCR, isothermal 
amplification, ligase chain reaction, and Qbeta replicase. The amplified nucleic acid 
products can be detected or quantitated, for instance, through hybridization to labeled 
probes. 

[0421] Amplification primers and hybridization probes for a CCI-779 activity gene 

can be prepared from the gene sequence or its corresponding CPS using numerous methods. 
Gene sequences suitable for this purpose include, but are not limited to, exons, introns, or 
the 3* or 5' untranslated regions, or any combination thereof. In one embodiment, 
probes/primers are designed based on the sequence in or near the 3' protein-coding region 
of a CCI-779 activity gene. For instance, the nucleotide sequence encoding the last 100 to 
300 amino acid residues in the C-terminus region of the CCI-779 activity gene product can 
be selected to design probes or primers. Where a CCI-779 activity gene is a hypothetical or 
putative gene whose expression is supported only by EST or mRNA data, or where the 
genomic location(s) of a CCI-779 activity gene has not been determined or the gene may 
correspond to multiple genomic loci, the probes/primers for the gene can be designed based 
on the corresponding CPS, or the oligonucleotide probes of the corresponding qualifier. 
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Table 5 lists example sequences that are usefiil for designing probes/primers for detecting 
the expression profiles of CCI-779 activity genes. 

[0422] The length of the probes/primers can be selected to achieve the desired 

hybridization or amplification effect. For instance, each probe can comprise at least 15, 20, 
25, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 400 or more nucleotides. 
In one embodiment, each probe/primer has relatively high sequence complexity and does 
not have any ambiguous residue (undetermined "n" residues). In another embodiment, the 
probes/primers can hybridize to the target gene! such as its RNA transcripts or the 
complements thereof, under stringent or highly stringent conditions. 

[0423] As used herein, "stringent conditions" are at least as stringent as, for 

example, conditions G-L : shown.in Table 7. "Highly stringent conditions" are at least as 
stringent as conditions A-F shown in Table 7. As used in Table 7, hybridization is carried 
out under the hybridization conditions (Hybridization Temperature and Buffer) for about 
four hours, followed by two 20-minute washes under the corresponding wash conditions 
(Wash Temp, and Buffer). 

Table 7. Stringency Conditions 



Stringency 
Condition 


Poly- 
nucleotide 
Hybrid 


Hybrid 
Length (bp) 1 


Hybridization 
Temperature and Buffer 11 


Wash Temp, 
and Buffer 11 


A 


DNA:DNA 


>50 


65°C; lxSSC -or- 

42°C: lxSSC. 50% formamide 


65°C; 0.3xSSC 


B 


DNADNA 


<50 


T B *; lxSSC 


Tb*; lxSSC 


C 


DNA:RNA 


F>50 


67°C; lxSSC -or- 

45°C; lxSSC. 50% formamide 


67°C; 0.3xSSC 


D 


DNA:RNA 


K50 


Tn*; lxSSC 


T n *; lxSSC 


E 


RNA: RNA 


>50 


70°C; lxSSC -or- 

50°C: lxSSC, 50% formamide 


70°C; 0.3xSSC 


F 


RNA:RNA 


<50 


T F *; lxSSC 


Tf*; lxSSC 


G 


DNA:DNA 


>50 


65°C; 4xSSC -or- 

42°C; 4xSSC, 50% formamide 


65°C; lxSSC 


H 


DNA:DNA 


<50 


T H *; 4xSSC 


T H *; 4xSSC 


I 


DNA:RNA 


>50 


67°C; 4xSSC -or- 

45°C; 4xSSC, 50% formamide 


67°C; lxSSC 


J 


DNArRNA 


<50 


Tj*; 4xSSC 


Tj*; 4xSSC 


K 


RNA: RNA 


>50 


70°C; 4xSSC -or- 

50°C; 4xSSC, 50% formamide 


67°C; lxSSC 


L 


RNA:RNA 


<50 


r L *; 2xSSC 


Ti *; 2xSSC 



1: The hybrid length is that anticipated for the hybridized r«gion(s) of the 
hybridizing polynucleotides. When hybridizing a polynucleotide to a target polynucleotide 
of unknown sequence, the hybrid length is assumed to be that of the hybridizing 
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polynucleotide. When polynucleotides of known sequence are hybridized, the hybrid length 
can be determined by aligning the sequences of the polynucleotides and identifying the 
region or regions of optimal sequence complementarity. 

H : SSPE (lxSSPE is 0.15M NaCl, lOmM NaH 2 P0 4 , and 1.25mM EDTA, pH 7.4) 
can be substituted for SSC (lxSSC is 0.15M NaCl and 15mM sodium citrate) in the 
hybridization and wash buffers, 

Tb* - Tr*: The hybridization temperature for hybrids anticipated to be less than 50 
base pairs in length should be 5-1 0°C less than the melting temperature (T m ) of the hybrid, 
where T m is determined according to the following equations. For hybrids less than 18 base 
pairs in length, T m (°C) = 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 
18 and 49 base pairs in length, T m (°C) = 81.5 + le^OogioNa*) + 0.41(%G + C) - (600/N), 
where N is the number of bases in the hybrid, and Na + is the molar -concentration of -sodium 
ions in the hybridization buffer (Na + for lxSSC = 0.165M). 

[0424] In one embodiment, the probes/primers for a CCI-779 activity gene are 

- selected from regions which significantly diverge from the sequences of other genes- Such 
regions can be determined by checking the probe/primer sequences against a human 
genome sequence database, such as the Entrez database at the NCBI. One algorithm 
suitable for this purpose is the BLAST algorithm. This algorithm involves first identifying 
high scoring sequence pairs (HSPs) by identifying short words of length W in the query 
sequence, which either match or satisfy some positive-valued threshold score T when 
aligned with a word of the same length in a database sequence. T is referred to as the 
neighborhood word score threshold. These initial neighborhood word hits act as ^eeds for 
initiating searches to find longer HSPs containing them. The word hits are then extended in 
both directions along each sequence to increase the cumulative alignment score. 
Cumulative scores are calculated using, for nucleotide sequences* the parameters M {reward 
score for a pair of matching residues; always >0) and N (penalty score for mismatching 
residues; always <0). The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. These parameters can be adjusted for different 
purposes, as appreciated by one of ordinary skill in the art. 

[0425] In one aspect, quantitative RT-PCR (such as TaqMan, ABI) is used for 

detecting and comparing the peripheral blood expression profiles of CCI-779 activity genes. 
Quantitative RT-PCR involves reverse transcription (RT) of RNA to cDNA followed by 
relative quantitative PGR (RT-PCR). 

[0426] In PCR, the number of molecules of the amplified target DNA increases by a 

factor approaching two with every cycle of the reaction until some reagent becomes 
limiting. Thereafter, the rate of amplification becomes increasingly diminished until there 
is not an increase in the amplified target between cycles. If one plots a graph on which the 
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cycle number is on the X axis and the log of the concentration of the amplified target DNA 
is on the Y axis, one observes that a curved line of characteristic shape is formed by 
connecting the plotted points. Beginning with the first cycle, the slope of the line is positive 
and constant. This is said to be the linear portion of the curve. After some reagent becomes 
limiting, the slope of the line begins to decrease and eventually becomes zero. At this point 
the concentration of the amplified target DNA becomes asymptotic to some fixed value. 
This is said to be the plateau portion of the curve, j 

[0427] The concentration of the target DNA in the linear portion of the PCR is 

proportional to the starting concentration of the target before the PCR was begun. By 
determining the concentration of the PCR products of the target DNA in PCR reactions that 
have completed the same number of cycles and are in their liij^ranges, c it^is jpsable to 
determine the relative concentrations of the specific target sequence in the original DNA 
mixture. If the DNA mixtures are cDNAs synthesized from RNAs isolated from different 
tissues or cells, the relative abundances of the specific mRNA from which the target 
sequence was derived may be determined for the respective tissues or cells. This direct 
proportionality between the concentration of the PCR products and the relative mRNA 
abundances is true in the linear range portion of the PCR reaction. 

[0428] The final concentration of the target DNA in the plateau portion of the curve 

is determined by the availability of reagents in the reaction mix and is independent of the 
original concentration of target DNA. Therefore, the sampling and quantifying of the 
amplified PCR products can be carried out when the PCR reactions are in the linear portion 
of their curves. In addition, relative concentrations of the amplifiable cDNAs can be 
normalized to some independent standard, which may be based on either internally existing 
RNA species or externally introduced RNA species. The abundance of a particular mRNA 
species may also be determined relative to the average abundance of all mRNA species in 
the sample. 

[0429] In one embodiment, the PCR amplification utilizes internal PCR standards 

that are approximately as abundant as the target. This strategy is effective if the products of 
the PCR amplifications are sampled during their linear phases. If the products are sampled 
when the reactions are approaching the plateau phase, then the less abundant product may 
become relatively over-represented. Comparisons of relative abundances made for many 
different RNA samples, such as is the case when examining RNA samples for differential 
expression, may become distorted in such a way as to make differences in relative 
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abundances of KNAs appear less than they actually are. This can be improved if the 
internal standard is much more abundant than the target. If the internal standard is more 
abundant than the target, then direct linear comparisons may be made between RNA 
samples. 

[0430] A problem inherent in clinical samples is that they are of variable quantity or 

quality. This problem can be overcome if the RT-PCR is performed as a relative 
quantitative RT-PCR with an internal standard in which the internal standard is an 
amplifiable cDNA fragment that is larger than the target cDNA fragment and in which the 
abundance of the mRNA encoding the internal standard is roughly 5-100 times higher than 
the mRNA encoding the target. This assay measures relative abundance, not absolute 
abundance of the respective mRNA species... «-^—- -= ....... 

[0431] In another embodiment, the relative quantitative RT-PCR uses an external 

standard protocol. Under this protocol, the PCR products are sampled in the linear portion 
of their amplification curves. The number of PCR cycles that are optimal for sampling can. 
be empirically determined for each target cDNA fragment In addition, the reverse 
transcriptase products of each RNA population isolated from the various samples can be 
normalized for equal concentrations of amplifiable cDNAs. While empirical determination 
_of-the-linear-range_of_the amplification curve and normalization of cD NA pre parations are 



tedious and time-consuming processes, the resulting RT-PCR assays may, in certain cases, 
be superior to those derived from a relative quantitative RT-PCR with an internal standard. 
[0432] Nucleic acid arrays can also be used to detect and compare the expression 

patterns of CCI-779 activity genes in peripheral blood samples isolated at different CCI-779 
treatment stages. Probes suitable for detecting the CCI-779 activity genes can be stably 
attached to known discrete regions on a support substrate, these probes maintain then- 
positions relative to the respective discrete regions during hybridization and subsequent 
washes. Construction of nucleic acid arrays is well known in the art. Suitable substrates for 
making nucleic acid arrays include, but are not limited to, glasses, silica, ceramics, nylons, 
quartz wafers, gels, metals, papers, beads, tubes, fibers, films, membranes, column matrixes, 
or microtiter plate wells. 

[0433] A nucleic acid array of the present invention can comprise at least 2, 5, 10, 

15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 150, 200, 250, or more different 
polynucleotide probes, each different probe capable of hybridizing to a different respective 
CCI-779 activity gene. Multiple probes for the same gene can be used on a single nucleic 
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acid array. Examples of probes suitable for this invention are listed in the Qualifier Table. 
Probes for other drug activities genes can also be included in the nucleic acid arrays of this 
invention. The probe density on the array can be in any range. For instance, the density 
may be 50, 1 00, 200, 300, 400, 500, or more probes/cm 2 . 

[0434] In one embodiment, a substantial portion of all polynucleotide probes on a 

nucleic acid array of the present invention are probes for CCI-779 or other drug activity 
genes. For instance, at least 10%, 20%, 30%, 40%, 50%, or more of all probes on the 
nucleic acid array can hybridize under stringent or nucleic acid array hybridization 
conditions to RNA transcripts, or the complements thereof, of drag activity genes. 
[0435] In another embodiment, nuclease protection assays are used to quantify 

RNAs derived from the peripheral blood samples. There are many different versions of 
nuclease protection assays. The common characteristic of these nuclease protection assays 
is that they involve hybridization of an antisense nucleic acid with the RNA to be 
quantified. The resulting, hybrid double-stranded molecule is then treated with a nuclease 
which digests single-stranded nucleic acids more efficiently than double-stranded 
molecules. The amount of antisense nucleic acid that survives digestion is a measure of the 
amount of the target RNA species to be quantified. An example of nuclease protection 
assays is the RNase protection assay manufactured by Ambion, Inc. (Austin, Texas). 
[0436] In another aspect, the peripheral blood expression profiles of CCI-779 

activity genes are determined by measuring the levels of polypeptides encoded by these 
genes. Methods suitable for this purpose include, but are not limited to, immunoassays such 
as ELISA, RIA, FACS, dot blot, Western Blot, immunohistochemistry, and antibody-based 
radioimaging. Other methods such as 2-dimensional SDS-polyacrylamide gel 
electrophoresis can also be used. 

[0437] One exemplary method suitable for detecting the levels of target proteins in 

peripheral blood samples is ELISA. In an exemplifying ELISA, antibodies capable of 
binding to the target proteins encoded by one or more CCI-779 activity genes are 
immobilized onto a selected surface exhibiting protein affinity, such as wells in a 
polystyrene or polyvinyJchloride microliter plate. Then, peripheral blood samples to be 
tested are added to the wells. After binding and washing to remove non-specifically bound 
immunocomplexes, the bound antigen(s) can be detected. Detection can be achieved by the 
addition of a second antibody which is specific for the target proteins and is linked to a 
detectable label. Detection may also be achieved by the addition of a -second antibody, 
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followed by the addition of a third antibody that has binding affinity for the second 
antibody, with the third antibody being linked to a detectable label. Before being added to 
the microtiter plate, cells in the peripheral blood samples can be lysed using various 
methods known in the art. Proper extraction procedures can be used to separate the target 
proteins from potentially interfering substances. 

[0438] In another exemplifying ELISA, the peripheral blood samples suspected of 

containing the target proteins are immobilized onto the well surface and then contacted with 
the antibodies of the invention. After binding and washing to remove non-specifically 
bound immunocomplexes, the bound antigen is detected. Where the initial antibodies are 
linked to a detectable label, the immunocomplexes can be detected directly. The 
immunocomplexes can also be detected using a second antibody that has binding affinity for 
the first antibody, with the second antibody being linked to a detectable label. 
[0439] Another exemplary ELISA involves the use of antibody competition in the 

detection. In this ELISA, the target proteins are immobilized on the well surface. The 
labeled antibodies are. added to the well, allowed to bind to the target proteins, and detected 
by means of their labels. The amount of the target proteins in an unknown sample is then 
determined by mixing the sample with the labeled antibodies before or during incubation 
with coated wells. The presence o f the targ et proteins in the unknown sample acts to reduce 
the amount of antibody available for binding to the well and thus reduces the ultimate 
signal. 

[0440] Different ELISA formats can have certain features in common, such as 

coating, incubating or binding, washing to remove non-specifically bound species, and 
detecting the bound immunocomplexes. For instance, in coating a plate with either antigen 
or antibody, the wells of the plate can be incubated with a solution of the antigen or 
antibody, either overnight or for a specified period of hours. The wells of the plate are then 
washed to remove incompletely adsorbed material. Any remaining available surfaces of the 
wells are then "coated" with a nonspecific protein that is antigenically neutral with regard to 
the test samples. Examples of these nonspecific proteins include bovine serum albumin 
(BSA), casein and solutions of milk powder. The coating allows for blocking of 
nonspecific adsorption sites on the immobilizing surface and thus reduces the background 
caused by nonspecific binding of antisera onto the surface. 

[0441] In ELISAs, a secondary or tertiary detection means ^an also be used. After 

binding of a protein or antibody to the well, coating with a non-reactive material to reduce 
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background, and washing to remove unbound material, the immobilizing surface is 
contacted with the control or clinical or biological sample to be tested under conditions 
effective to allow immunocomplex (antigen/antibody) formation. These conditions may 
include, for example, diluting the antigens and antibodies with solutions such as BSA, 
bovine gamma globulin (BGG) and phosphate buffered saline (PBS)/Tween and incubating 
the antibodies and antigens at room temperature for about 1 to 4 hours or at 4°C overnight. 
Detection of the immunocomplex then requires a labeled secondary binding ligand or 
antibody, or a secondary binding ligand or antibody in conjunction with a labeled tertiary 
antibody or third binding ligand. 

[0442] Following all incubation steps in an ELISA, the contacted surface can be 

washed so as to remove non-complexed material. For instance, the surface may be washed 
with a solution such as PBS/Tween, or borate buffer. Following the formation of specific 
immunocomplexes between the test sample and the originally bound material, and 
subsequent washing, the occurrence of the amount of immunocomplexes . can be determined. 
[0443] To provide a detection means, the second or third antibody can have an 

associated label. In one embodiment, the label is an enzyme that generates color 
development upon incubating with an appropriate chromogenic substrate. Thus, for 
example, one may contact and incubate the first or second immunocomplex with a urease, 
glucose oxidase, alkaline phosphatase or hydrogen peroxidase-conjugated antibody for a 
period of time and under conditions that favor the development of further immunpcomplex 
formation (e.g., incubation for 2 hours at room temperature in a PBS-containing solution 
such as PBS-Tween). 

[0444] After incubation with the labeled antibody, and subsequent to washing to 

remove unbound material, the amount of label is quantified, e.g., by incubation with a 
chromogenic substrate such as urea and bromocresol purple or 2,2'-azido-di-(3-ethyl> 
benzmiazoline-6-sulfonic acid (ABTS) and H 2 0 2 , in the case of peroxidase as the enzyme 
label. Quantitation can be achieved by measuring the degree of color generation, e.g., using 
a spectrophotometer. 

[0445] Another method suitable for this invention is RIA (radioimmunoassay). An 

exemplary RIA is based on the competition between radiolabeled-polypeptides and 
unlabeled polypeptides for binding to a limited quantity of antibodies. Suitable radiolabels 
include, but are not limited to, I 125 . In one embodiment, a fixed concentration of I 125 -labeled 
polypeptide is incubated with a series of dilution of an antibody specific to the polypeptide. 
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When the unlabeled polypeptide is added to the system, the amount of the I -polypeptide 
that binds to the antibody is decreased. A standard curve can therefore be constructed to 
represent the amount of antibody-bound I 125 -polypeptide as a function of the concentration 
of the unlabeled polypeptide. From this standard curve, the concentration of the 
polypeptide in unknown samples can be determined. 

[0446] Suitable antibodies for this invention include, but are not limited to, 

polyclonal antibodies, monoclonal antibodies, chimeric antibodies, humanized antibodies, 
single chain antibodies, Fab fragments, or fragments produced by a Fab expression library. 
Methods for making these antibodies are well known in the art. 

[0447] In one embodiment, the antibodies of the present invention can bind to the 

corresponding CCI-779 activity gene products or other desired antigens with a binding 
affinity constant of at least 10 4 M'\ 10 5 M~\ 10 6 M" 1 , 10 7 M"\ 10 8 NT 1 , or more. 
[0448] The antibodies of this invention can be labeled with one or more detectable 

moieties to allow for detection of antibody-antigen complexes. The detectable moieties can 
include compositions, detectable by spectroscopic, enzymatic, photochemical, biochemical, 
bioelectronic, immunochemical, electrical, optical or chemical means. The detectable 
moieties include, but are not limited, to, radioisotopes, chemiluminescent compounds, 
labeled binding proteins, heavy metal atoms, spectroscopic markers such as fluorescent 
markers and dyes, magnetic labels, linked enzymes, mass spectrometry tags, spin labels, 
electron transfer donors and acceptors, and the like. 

[0449] Moreover, the levels of polypeptides in peripheral blood samples can be 

determined by detecting the biological activities associated with the polypeptides.- If a 
biological function/activity of a polypeptide is known, suitable in vitro bioassays can be 
designed to evaluate the biological function/activity, which in turn can be used to determine 
the amount of the polypeptide in the sample. 

[0450] Comparison between the expression profile of a patient of interest and a 

reference expression profile can be conducted manually or electronically. The reference 
expression profile can be a baseline expression profile representing gene expression in 
peripheral blood samples isolated prior to a drug treatment The reference profile can also 
be an expression profile in peripheral blood samples isolated after initiation of the drug 
treatment The reference expression profile can be determine using sample isolated from 
the patient of interest or other reference patient or patients. In many embodiments, the 
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process or methodology that is used to determine the reference expression profile and the 
expression profile being compared is identical or comparable. 

[0451] In one example, comparison is carried out by comparing each component in 

the expression profile of the patient of interest to the corresponding component in the 
reference expression profile(s). The component can be the expression level of a drug 
activity gene, a ratio between the expression levels of two drug activity genes, or another 
measure capable of representing gene expression patterns. The expression level of a gene 
can be an absolute level, or a normalized or relative level. The difference between two 
corresponding components can be assessed by fold changes, absolute differences, or other 
suitable means. 

[0452] Comparison between expression profiles can also be conducted using pattern 

recognition or comparison programs. In addition, the serial analysis of gene expression 
(SAGE) technology, the GEMTOOLS gene expression analysis program (Incyte 
Pharmaceuticals), the GeneCalling and Quantitative Expression Analysis technology 
(Curagen), and other suitable methods, programs or systems can be used. 
[0453] Multiple drug activity genes can be used in the comparison of expression 

profiles. For instance, 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 30, 40, 50, or more drug activity 
genes can be used. 



[0454] In one embodiment, the drug activity gene(s) used in the comparison can be 

selected to have relatively small p-values or large differential expression ratios. In one 
example, the drug activity genes used in the comparison have p-values (e.g., under an 
ANOVA analysis) of no greater than 0.05, 0.01, 0.001, 0.0005, 0.0001, or less. In another 
example, the expression level after or during a drug treatment is increased or decreased by 
at least 2-fold, 3-fold, 4-fold, 5-fold, or more over the baseline expression level. 
[0455] In another embodiment, comparison of the expression profiles is performed 

electronically, such as by using a computer system. The computer system includes a 
processor coupled to a memory or storage medium which stores data representing the 
expression profiles being compared. In one example, the memory or storage medium is 
readable or rewritable. The stored expression data can be changed, retrieved, or otherwise 
manipulated. The memory can also store one or more programs capable of causing the 
processor to compare the expression profiles. In one embodiment, the processor is coupled 
to a nucleic acid array scanner which sends signals to the processor for analysis. 
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10456] In vivo activities of other drugs can be similarly monitored using toe drug 

activities genes of the present invention, as appreciated by those skilled in the art. 
[04571 At least a subset of the drug activity genes of the present invention may be 

used as disease regression indicators. This subset of genes may be responsive to the effect 
of CCI-779 or other drugs on RCC or other diseases. In one example, CCI-779 or another 
drug may cause regression of tumors, and the decrease in tumor size or activity may cause a 
decreased immune response which is reflected in gene expression changes in the peripheral 
blood cells. Consequently, the drug activity genes of the present invention can be used as 
molecular markers for monitoring the efficacy of CCI-779 or other drugs for treating RCC 
or other diseases. In many cases, diseases amendable to the present invention include those 
whose regression can produce changed immune responses. 

[04581 It should be understood that the above-described embodiments and the 

following examples are given by way of illustration, not limitation. Various changes and 
modifications within the scope of the present invention will become apparent to those 
skilled in the art from the present description. 

E. Examples 

Exam ple 1. Isolation of KNA and Prepara tion of Labeled Microarray Targets 

[0459J PBMCs from the CCI-779 clinical trials were isolated from whole blood 

samples (8mL) and collected into CPT tubes according to toe standard procedure. PBMCs 
were purified over Ficoll gradients, washed two times With PBS and counted. Total RNA 
was isolated from PBMC pellets using the RNeasy mini kit (Qiagen, Valencia, CA). 
Labeled target for oligonucleotide arrays was prepared using a modification of I the 
procedure described in Lockhart, et al, Nature Biotechnology 14: 1675-80 (1996). 2 ng 
total RNA was converted to cDNA by priming with an oligo-dT primer containing a T7 
DNA polymerase promoter at the 5' end. The cDNA was used as the template for in vitro 
transcription using a T7 DNA polymerase kit (Ambion, Woodlands, TX) and biotinylated 
CTP and UTP (Enzo). Labeled cRNA was fragmented in 40 mM Tris-acetate pH 8.0, 100 
mM KOAc, 30 mM MgOAc for 35 minutes at 94°C in a final volume of 40 jiL 
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[0460] Individual samples were hybridized to HgU95A genechip (Asymetrix). No 

samples were pooled. 1 10 RCC patients were involved in the study. Each patient received 
25, 75, or 250 mg of CCI-779 once weekly through intravenous infusion. Blood samples 
were collected immediately before the first CCI-779 infusion, eight weeks after the first 
infusion, and sixteen weeks after the first infusion. 

[0461] 10 ^g of labeled target was diluted in.l x MES buffer with 100 ng/ml herring 

sperm DNA and 50 jig/ml acetylated BSA. To normalize arrays to each other and to 
estimate the sensitivity of the oligonucleotide arrays, in vitro synthesized transcripts of 11 
bacterial genes were included in each hybridization reaction as described in Hill et al, 
JScience, 290: 809-812 (2000). The abundance of these transcripts ranged from 1:300,000 
(3 ppm) to 1:1000 (1000 ppm) stated in terms of the number of control transcripts per total 
transcripts. As determined by the signal response from these control transcripts, the 
sensitivity of detection of the arrays ranged between about 1:300,000 and 1:100,000 
copies/million. Labeled probes were denatured at 99°C for 5 minutes and then 45°C for 5 
minutes, and hybridized to oligonucleotide arrays comprised of over 12,500 human gene 
probes (HgU95A, Affymetrix). Arrays were hybridized for 16 hours at 45°C. The 
hybridization buffer was comprised of 100 mM MES, 1 M [Na*], 20 mM EDTA, and 0.01% 
Tween 20. After hybridization, the cartridges were washed extensively with non-stringent 
wash buffer (6 x SSPET), such as three 10-minute washes at room temperature. These 
hybridization and wash conditions are herein collectively referred to as "nucleic acid array 
hybridization conditions/* The washed cartridges were then stained with phycoerythrin 
coupled to streptavidin. 

[0462] 12 x MES stock contains L22 M MES and 0.89 M [Na 4 ]. For 1000 ml, the 

stock can be prepared by mixing 70.4 g MES free acid monohydrate, 193.3 g MES sodium 
salt and 800 ml of molecular biology grade water, and adjusting volume to 1000 ml. The 
pH should be between 6.5 and 6.7. 2 x hybridization buffer can be prepared by mixing 8.3 
mL of 12 x MES stock, 17.7 mL of 5 M NaCl, 4.0 mL of 0.5 M EDTA, 0.1 mL of 10% 
Tween 20 and 19.9 mL of water. 6 x SSPET contains 0.9 M NaCl, 60 mM NaH 2 P0 4 , 6 mM 
EDTA, pH 7.4, and 0.005% Triton X-100. In some cases, the wash buffer can be replaced 
with a more stringent wash buffer. 1000 ml stringent wash buffer can be prepared by 
mixing 83.3 mL of 12 x MES stock, 5.2 mL of 5 M NaCl, 1.0 mL of 10% Tween 20 and 
910.5 mL of water. 
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Example 3. flene Expr ession Data Analysis 

10463] Data analysis was performed on raw fluorescent intensity values using 

GENECHIP 3.2 software (Affymetrix). GeneChip 3.2 software uses an algorithm to 
calculate the likelihood as to whether a gene is "absent" or "present" as well as a specific 
hybridization intensity value or "average difference" for each transcript represented on the 
array. The algorithms used in these calculations are described in the Affymetrix GeneChip 
Analysis Suite User Guide (Affymetrix). The "average difference" for each transcript was 
normalized to "frequency" values according to the procedures of Hill et al, Science, 290: 
809-812 (2000). This was accomplished by referring the average difference values on each 
chip to a calibration curve constructed from the average difference values for the 1 1 control 
transcripts with known abundance that were spiked into each hybridization solution. This 
process also served to normalize between arrays. 

[0464] Specific transcripts were evaluated further if they met the following criteria. 

First, genes that were designated "absent" by the GENECHIP 3.2 software in all samples 
were excluded from the analysis. Second, in comparisons of transcript levels between 
arrays, a gene was required to be present in at least one of the arrays. Third, for 
comparisons of transcript levels between groups, an ANOVA was applied to identify a 
subset of transcripts that had a significant (p < 0.05) differences in frequency values. . 
10465] A ANOVA was used to compare PBMC expression profiles measured 

immediately before the first infusion of CCI-779, to PBMC expression profiles measured at 
eight and sixteen weeks after the first infusion. In the comparisons, a p value < 0.05 was 
used to indicate statistical significance. Transcripts that were altered, on average, by 2-fold 
or more between any two time points can be selected. 

Exam ple 4. E x Vivo Assays 

i 

[0466] Drug modulable transcripts in the surrogate tissue, such as peripheral blood, 

can provide early evidence of drug exposure in vivo. PBMC ex vivo assays that mimic 
peripheral blood gene expression in disease states as well as the in vivo response to drug 
treatments can further validate the surrogate markers identified in in vivo studies. In 
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addition, ex vivo assays that overlay with disease-associated biomarkers may provide 
evidence for markers of drug-dependent disease amelioration. 

[0467] Many factors should be considered in designing suitable ex vivo assays. 

These factors include, but are not limited to, PBMC seeding density, culture media and 
other conditions, vehicle control, dose response, time course, and dose schedule (e.g., 
preincubation, coincubation, or postincubation). 

[0468] In one embodiment, phytohemagglutinin (PHA) was used to stimulate gene 

expression in PBMCs. The genes whose expression can be stimulated by PHA in PBMCs 
had a substantial overlap with disease-associated transcripts in RCC PBMCs that appear to 
result from T-cell activation. In some cases, PHA-stimulated PBMCs mimicked anti- 
CD3/anti-CD28 stimulated T-cell profiles in BioExpress. In addition, CCI-779 inhibited 
certain PHA-inducible transcripts in a dose-dependent manner. The inhibition by CCI-779 
appeared to be specific, rather than global. In one example, CTSL, SCYA2, FABP5, 
SCYA7, ATP2B1, and IL1R1 genes were directly repressed by 300 ran CCI-779 in PHA- 
stimulated PBMCs in an ex vivo assay. Databases of disease-associated profiles and ex vivo 
activation signatures can be created to identity most appropriate culture conditions for 
identification of drug modulated biomarkers in specific disease settings. 
[0469] The foregoing description of the present invention provides illustration and 

description, but is not intended to be exhaustive or to limit the invention to the precise one 
disclosed Modifications and variations are possible consistent with the above teachings or 
may be acquired from practice of the invention. Thus, it is noted that the scope of the 
invention is defined by the claims and their equivalents. 
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What is claimed is: 

1 . A method comprising comparing an expression profile of at least one gene in 
a peripheral blood sample of a patient to a reference expression profile of said at least one 
gene, wherein said at least one gene is differentially expressed in peripheral blood 
mononuclear cells (PBMCs) of patients who have a non-blood disease and are subject to a 
drug therapy as compared to PBMCs isolated from said patients before said drug therapy, 
and wherein the patient has the non-blood disease and is being treated by said drug therapy. 

2. The method according to claim 1, wherein said drug therapy is a CCI-779 
therapy. 

3. The method according to claim 2, wherein the non-blood disease is a solid 

tumor. 

4. The method according to claim 3, wherein the solid tumor is RCC. 

5. The method according to claim 2, wherein said at least one gene includes one 
or more genes selected from Table 5. 

6. The method according to claim 2, wherein said at least one gene includes at 
least two genes selected from Table 5. 

7. The method according to claim 1, wherein the peripheral blood sample is a 
whole blood sample. 

8. The method according to claim I, wherein the peripheral blood sample 
comprises enriched PBMCs. 

9. The method according to claim 1, wherein the expression profile is 
determined by RT-PCR or immunoassays. 
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10. The method according to claim 1, wherein the reference expression profile is 
an average expression profile of said at least one gene in peripheral blood samples isolated 
from said patients before said drug therapy. 

1 1 . The method according to claim 1 , wherein the reference expression profile is 
an expression profile of said at least one gene in a reference peripheral blood sample 
isolated from the patient before said drug therapy. 

12. The method according to claim 1, wherein said at least one gene includes one 
or more genes which are over-expressed or under-expressed in PBMCs of patients who have 
the non-blood disease as compared to PBMCs of humans who do not have the non-blood 
disease, and wherein said drug therapy is capable of down-regulating or up-regulating 
expression of said one or more genes in PBMCs of patients who have the non-blood 
disease. 

13. The method according to claim 1, wherein said at least one gene includes one 
or more genes whose expression in PBMCs is capable of being increased or reduced by a 
phytohemagglutinin treatment, and wherein said drug therapy is capable of down-regulating 
or up-regulating expression of said one or more genes in phytohemagglutinin-treated 
PBMCs. 

14. The method according to claim 1, wherein RNA transcripts of said at least 
one gene are capable of hybridizing under stringent or nucleic acid array hybridization 
conditions to one or more qualifiers selected from the Qualifier Table. 

15. The method according to claim 14, wherein said drug therapy is a CCI-779 
therapy. 

16. The method according to claim 15, wherein the non-blood disease is RCC. 

17. A method for identifying drug activity genes, comprising: 

detecting an expression profile of genes in peripheral blood samples of 
patients who have a non-blood disease and are subject to a drug therapy; and 
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comparing said expression profile to a baseline expression profile of said 
genes in peripheral blood samples isdated from said patients before said drug therapy so as 
to identify drug activity genes whose expression levels in peripheral blood samples can be 
modulated by said drug therapy. 

18. A kit comprising a plurality of polynucleotides, wherein each of said 
polynucleotides is capable of hybridizing under stringent or nucleic acid array hybridization 
conditions to an RNA transcript, or the complement thereof, of a different respective gene 
selected from Table 5. 

19. A kit comprising a plurality of antibodies, wherein each of said antibodies is 
capable of binding to a polypeptide encoded by a different respective gene selected from 
Table 5. 

20. A nucleic acid array comprising polynucleotide probes, wherein a substantial 
portion of all polypeptide probes on the nucleic acid array can hybridize under stringent or 
nucleic acid array hybridization conditions to RNA transcripts, or the complements thereof, 
of genes selected from Table 5. 
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Present claim 19 relates to an extremely large number of possible kits : .v 
which comprises a plurality of antibodies. In fact, the claim contains 
so many possible options that a lack of clarity and conciseness within 
the meaning of Article 6 PCT arises to such an extent as to render a 
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matter of claim 6 is considered that unclear and inconcise that a 
meaningful search was not possible for any method of claim 6 which 
relates to at least two genes wherein one of the genes is the 
DKFZP564E1962 gene. 
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not been searched. This is the case irrespective of whether or not the 
claims are amended following receipt of the search report or during any 
Chapter II procedure. If the application proceeds into the regional phase 
before the EPO, the applicant 1s reminded that a search may be carried 
out during examination before the EPO (see EPO Guideline C-VI, 8.5), 
should the problems which led to the Article 17(2) declaration be 
overcome . 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

Invention 1 (claims 1,2,5,6 (all partially); 3,4,7-11 (all 
completely)) : 

A method comprising comparing an expression profile of at 
least one gene in a peripheral blood sample of a patient to 
a reference profile of said gene which is differentially 
expressed in PBMCs of patients who have a non-blood disease 
and are subject to a drug therapy as compared to PBMCs 
isolated from said patients before said drug therapy and 
wherein the patient has the non-blood disease and is being 
treated by. sai d. drug .therapy and wherein in .particular tne t . 
at least one gene is DKFZP564E1962, the non-blood disease is 
RCC and the drug is CCI-779. 

Inventions 2-312 (claims 1,2,5,6 (all partially)): 

A method comprising comparing an expression profile of at 
least one gene in a peripheral blood sample of a patient to 
a reference profile of said gene, wherein the at least one 
gene is ETR101 which is differentially expressed in PBMCs of 
patients who have a non-blood disease and are subject to 
CCI-779 therapy as compared to PBMCs isolated from said 
patients before said drug therapy. 

..ibidem for Inventions 3-312 relating to a different gene 
selected from table 5. 

Inventions 313-348 (claims 1,12 and 13 (all partially)): 

A method according to claim 1, wherein said at least one 
gene is IL1R1 which is differentially expressed in PBMCs of 
patients who have a non-blood disease as compared to PBMCs 
of humans who do not have the non-blood disease, and wherein 
said drug therapy is capable of down-regulating or # 
up-regulating expression of said one or more genes in PBMCs 
of patients who have the non-blood disease. 
..ibidem for each gene listed in table 6. 

Invention 349 (claims 1 and 13 (both partially)): 

A method according to claim 1, wherein said at least one 
gene includes ATP2B1 whose expression in PBMCs is capable of 
being increased or reduced by a PHA treatment, and wherein 
said drug therapy is capable of down regulating or # 
up-regulating expression of said one or more genes in 
phytohemagglutinin-treated PBMCs. 



Invention 350 (claim 17 (completely)): 
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A method of identifying drug activitiy genes, comprising: 
detecting an expression profile of genes in peripheral blood 
samples of patients having a non-blood disease and are 
subject to a drug therapy, and comparing said expression 
profile to a baseline expression profile of said genes in 
peripheral blood samples isolated from said patients before 
said drug therapy so as to identify drug activity genes 
whose expression levels in peripheral blood samples can be 
modulated by said drug therapy. 

« 



Invention 351 (claims 18-20 (all completely)) 

.A. kit comprising a plurality., of. polynucleotides,, wherein - 
each of said polynucleotides is capable of hybridizing under 
stringent or nucleic acid array hybridization conditions to 
an RNA transcript, or the complement thereof, of a different 
respective gene selected from table 5; a kit comprising a 
plurality of antibodies, wherein each of said antibodies is 
capable of binding to a polypeptide encoded by a different 
respective gene selected from table 5; a nucleic acid array 
comprising polynucleotide probes, wherein a substantial 
portion of all polypeptide probes on the nucleic acid array 
can hybridize under stringent or nucleic acid conditions to 
RNA transcripts, or the complements thereof, of genes 
selected from table 5. 
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